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Abstract 

Measurement of the spatial distribution of neutral hydrogen via the redshiftcd 21 cm line 
promises to revolutionize our knowledge of the epoch of reionization and the first galaxies, and 
may provide a powerful new tool for observational cosmology from redshifts 1 < z < 4. In this 
review we discuss recent advances in our theoretical understanding of the epoch of reionization 
(EoR), the application of 21 cm tomography to cosmology and measurements of the dark energy 
equation of state after reionization, and the instrumentation and observational techniques shared 
by 21 cm EoR and post reionization cosmology machines. We place particular emphasis on the 
expected signal and observational capabilities of first generation 21 cm fluctuation instruments. 
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1 Evolution of HI in the Universe 

Three major stages in the evolution of our universe are written in the phases of 
hydrogen. After nucleosynthesis the universe was an ionized plasma of hydro- 
gen and helium. As expansion cooled the universe, hydrogen went through a 
phase transition, rapidly becoming neutral and releasing the Cosmic Microwave 
Background light at a redshift of ~1089. High energy photons produced by the 
first stars and quasars later reionized the hydrogen in the inter galactic medium 
(IGM), forcing the universe back through a second extended and patchy phase 
transition referred to as the epoch of reionization (EoR). 

The epoch of reionization was an important milestone in the history of the uni- 
verse for two reasons. First, reionization identifies the epoch when astrophysical 
sources produced of order one photon per baryon, and so became the dominant 
influence on the conditions in the intergalactic medium. Turning this around, 
the study of reionization provides an opportunity to study the properties of the 
first galaxies and stars. Since the ionization and temperature of the medium 
play a significant regulatory role in galaxy formation for all subsequent galaxy 
evolution, reionization must be understood as part of a complete theory of galaxy 
formation at any cosmic epoch. 

The redshift of the ionized to neutral phase transition is well measured by the 



cosmic microwave background (CMB, Komatsu et al. (2009)), but the redshift 



range of the Epoch of Reionization is significantly less certain. The absence of 
Gunn-Peterson absorption troughs in the spectra of high redshift quasars indi- 
cates that the universe was reionized by a redshift of six and has been highly 



ionized for the subsequent 12 billion years (see the review by Fan, Carilli &; 



Keating 20061. At z > 6 interpretation of the Lyman-a quasar absorption data 
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becomes more uncertain. 

Recent advances in radio instrumentation and techniques will soon make it 
possible to observe the 21 cm emission of the neutral hydrogen itself. For the 
EoR this would enable detailed studies of structure formation and the formation 
of the first galaxies. After reionization, the neutral hydrogen would trace the 
locations of galaxies and through the power spectrum provide a very promising 
way of measuring the expansion history of space over redshifts of 1 < z < 4. 

In this review we concentrate on the astrophysics and cosmology which will 
be enabled by the first generation EoR and post reionization 21 cm fluctuation 
experiments. This review should be seen as an installment in a series of reviews by 
different authors ( Barkana Sz Loeb|2001 Fan, Carilli Sz Keating|2006 Furlanetto 



Oh Sz Briggs 20061. In particular, the excellent and encyclopedic EoR review 



by Furlanetto, Oh Sz Briggs (20061 goes into many details we do not have the 
space to cover here. In this review we will concentrate on three areas: recent 
advances in our theoretical understanding of what the EoR may look like, the 
application of 21 cm tomography to cosmology observations after reionization, 
and the instrumentation and observational requirements shared by both EoR 
and post- reionization HI machines. In our numerical examples, we adopt the 



currently standard set of cosmological parameters (Komatsu et al. 



2009 



1.1 Current understanding of HI 

1.1.1 HYDROGEN DURING THE REIONIZATION ERA The discovery of distant 
quasars over the last decade has facilitated detailed Lya absorption studies of 
the state of the high redshift IGM at a time when the universe was as little as 



^ nm = 0.24, fib ~ 0.04 and fig = 0.76 for the matter, baryon, and dark energy fractional 
density respectively, h = 0.73 for the dimensionless Hubble constant, and as = 0.81 for the 
variance of the linear density field within regions of radius 8/i~^Mpc. 
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a billion years old (Fan et al. 2006). Unfortunately, Lya absorption can only 
be used to probe volume averaged neutral fractions that are smaller than 10^'^ 
owing to the large cross-section of the Lya resonance. As a result, studies of 
the ionization state of the mean IGM using Lya absorption become inconclusive 



in the era of interest for reionization (e.g. Bolton &; Haehnelt 2007a Lidz et al. 
2007b[ [Becker, Ranch Sargent pOOT] ). 



On the other hand Bolton Sz Haehnelt (2007b I have shown that the observed 
ionization rate at z ~ 6 implies an emissivity that is only just sufficient to have 
reionized the universe by that time, indicating that the reionization of the IGM 
may have been photon starved. Similarly, the small escape fractions found for 



high redshift galaxies by several studies (e.g. Chen, Prochaska &: Gnedin 20071, 
together with the star formation rates implied by the observed high redshift 
galaxy population suggests a photon budget that struggles to have been sufficient 
to reionize the universe by z ~ 6 ( Gnedin||2008 |. If true, these results imply that 
while the IGM seems to be highly ionized along the lines-of-sight towards the 
highest redshift quasars, the reionization epoch should not have occurred at a 
redshift substantially higher than z ~ 7 — 8. 

Thomson scattering of CMB photons provides a complimentary constraint to 
the Lya forest (and also to related studies such as those using Lya emitters, e.g. 



Ota et al. 2008). The Thomson scattering optical depth due to free electrons 



in the IGM probes the integrated ionization along the line-of-sight between the 
observer and the surface of last scattering, and thus probes the whole history 
over which the universe was reionized. The optical depth can therefore be used to 
estimate the redshift at which reionization was substantially underway, although 
it provides only a single number to describe the integrated reionization history 
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and so is degenerate among a wide range of reionization scenarios. Assuming an 

instantaneous redshift for reionization, measurements from the WMAP satellite 

yield Zreion = 10.9 ± 1.4 ( jKomatsu et"al]|2009| ) . 



Several authors have considered the constraints available on the reionization 
history based on a combination of existing observations. For example Choudhury| 



Sz Ferrara (20061 and Pritchard, Loeb Sz Wyithe (20091 have modeled the reion- 



ization history of hydrogen and compared predictions to a range of observables, 
using results for specific physical models and generic parameterised models re- 
spectively. The consistent message from modeling of the reionization history is 
that reionization must have been completed by z > 6.5, but could not have been 
completed at a redshift substantially higher than z ~ 7 — 8 given the value of 
optical depth to electron scattering. An example reionization history showing the 
evolution of the mean mass weighted neutral fraction (designated xm throughout 
this review) in a model that is consistent with current constraints is shown in 
Figure [T] for illustration. 

These studies illustrate the limited utility of current observational probes of the 
IGM for studying the EoR in detail. Lya absorption studies become ineffective 
as reionization is approached due to the high optical depth of the absorption 
line, and can only be studied along a few sight lines due to the rarity of bright 
quasars. The CMB optical depth currently offers our most robust constraints, 
but it is restricted to being an integral, sky averaged measure (though the Planck 



satellite may be able to trace some details of the reionization history. Holder et al. 



2003). What is required to further advance our knowledge of reionization and 
the first galaxies is measurement of the ionization state of the IGM as a function 
of both time and space. Such an opportunity is provided by tomography of the 
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redshifted 21 cm emission line (e.g. Tozzi et al. 20001. 



1.1.2 HYDROGEN FOLLOWING THE COMPLETION OF REIONIZATION Follow- 
ing the completion of reionization HI was confined to over-dense regions of the 
IGM where the enhanced recombination rate maintains a non-zero fraction of 
hydrogen in atomic form. The situation therefore differs between low and high 
density regions of the universe which contain optically thin Lya absorbers and 
self-shielded damped Lya systems (DLAs) respectively. The ionization fraction 
in the low density regime is controlled by the balance between the ionization 
rate owing to the UV background and the recombination rate at the local gas 



density (e.g. Bolton et al. 20051. In contrast, DLAs are known to be dense and 



self-shielded (Wolfe, Gawiser Sz Prochaska 20051. Although little is understood 



about the nature of DLAs, it is known that they play a very important role 
in the evolution of hydrogen, containing ~ 80% of the HI in the universe at 



z < 4 (Prochaska, Herbert- Fort & Wolfe 2005 1 . Their clustering (Cooke et al. 



2006) and abundance (Zwaan et al. 2005), as well as theoretical expectations 



(Nagamine et al. 2007) suggest that DLAs are housed within galactic mass sys- 



tems. The upper right hand panel of Figure [T] illustrates the mean evolution of 
the mass and volume weighted HI content of the universe following the end of 
reionization, compared with predictions of an analytic model. Most of the HI 
content is in dense clumps, and so does not contribute to the Lya optical depth. 

1.1.3 The 21 cm brightness temperature The 21 cm line, correspond- 
ing to the ground state hyperfine transition of atomic hydrogen, has a long history 
in galactic and extra galactic astronomy. Studies have focused on the inter-stellar 
medium, either of our own galaxy or of other galaxies. However recent develop- 
ments in low frequency radio arrays are making it possible to study diffuse HI at 
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high redshift. 

The fluctuations in 21 cm intensity (or brightness temperature) from difl'er- 
ent regions of the IGM include contributions from a range of different physical 
properties, including density, velocity gradients, gas temperature, gas spin tem- 



perature and ionization state (Furlanetto, Oh Sz Briggs 20061. Quantitatively. 



the 21 cm brightness temperature contrast may be written 



1 



AT = 23.8 ( ^ j ' [1 - Xi{l + 4)] (1 + 6,) 



Ts — Tqmb 



mK, (1) 



where Xi is the mean ionization fraction (xj = 1— xhi), S is the dark matter density 
fluctuation, 5x is the ionization fraction fluctuation, 5y = {1 + z)H"^dvr/dr is 
the velocity distortion sourced fluctuation (with dvr/dr being the gradient of 
the peculiar velocity along the line-of-sight with Hubble parameter H)^ and Tg 
and TcMB are the spin temperature of the HI and the temperature of the CMB 
background radiation respectively. 

The evolution of the ionized fraction Xi{z) from zero to one should produce a 
soft ~ 20 mK 'step' in the spectrum as a function of frequency towards the end of 
the EoR (Figure[T] lower left panel). While the global step (e.g. Shaver et al.|1999" 



Bowman, Rogers Hewitt|2008l ) may provide a mean redshift of reionization and 



constrain the very high redshift spin temperature evolution (Pritchard &: Loeb 



2008), observations of the 21 cm fluctuations (which are comparable in amplitude 
to the mean signal; lower right panel in Figure [T| have the potential to unravel 
the processes behind the EoR and probe cosmology. 

In this review we concentrate on the theory and observations associated with 
21 cm fluctuations, and restrict our attention predominantly to those predictions 
for the 21 cm signal of reionization that are most applicable to low frequency 



Cosmic 21 cm Fluctuations 9 
arrays which are either in planning or under construction. The fluctuations and 

evolution of the kinetic and spin temperatures of HI are frontier issues in theo- 



retical studies of the EoR (e.g. Pritchard &: Furlanetto 2007 Pritchard &: Loeb 



20081. However the first generation of EoR instruments will have good signal to 
noise only during reionization, and as a result we have focused our attention on 
analyses that assume the spin temperature of hydrogen is coupled to the kinetic 
temperature of an IGM that has been heated well above the CMB temperature 
(i.e. Ts ^ Tcmb)- This condition should hold during the later stages of the 



reionization era {z 9 Santos et al. 20071 and for coUisionally coupled gas in 



collapsed objects after the completion of reionization (observed in some DLAs 
Curran et al. 2007 1. In this regime the term [{Ts — Tcmb)/7s] in equation ([T]) 



approaches unity and fluctuations in the kinetic and spin temperatures of the gas 
can be ignored resulting in a proportionality between the ionization fraction and 
21 cm intensity. If true, this proportionality greatly simplifies the prediction and 
interpretation of the 21 cm signal. 

1.2 Forthcoming instruments for 21 cm observations 

While 21 cm EoR and post-reionization cosmology instruments are conceptually 
quite similar (see the difference in frequency (redshift) necessitates that the 
EoR and lower redshift cosmology measurements will almost certainly be per- 
formed by separate instruments. There are four EoR arrays planning to start 
sensitive observations within the next year and a half, and there are several 
lower redshift "intensity mapping" machines under active development. The HI 
machines are pushing the state-of-the-art in instrumental and ionospheric calibra- 
tion, large and ultra-large N correlators, widefield observation, and high dynamic 
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range imaging. In addition to opening a new field of cosmology, these telescopes 

are laying a technical foundation for future radio instruments at all frequencies. 
In this section, we briefly introduce the first generation 21 cm fluctuation exper- 
iments and discuss their unique characteristics. 

1.2.1 GMRT The Giant Metre- wave Radio Telescop^ is a general purpose 
low frequency observatory in the Pune region of India, about 120 km east of 
Mumbai. The GMRT has thirty 45 m diameter dishes, the central 14 of which 
are within a 1 km region and are of particular use for EoR power spectrum ob- 



servations (see ^3.2 for a discussion of sensitivity and antenna separations). The 
GMRT epoch of reionization team has upgraded the digital processing system, 
developed a novel pulsar-based calibration approach, and developed advanced 
techniques for identifying and removing both narrowband and broadband radio 



frequency interference (RFI, Pen et al. 2008b). The GMRT team has already 
collected several hundred hours of observations, and additional analysis and ob- 
servations are ongoing. 

1.2.2 PAPER The Precision Array to Probe Epoch of Reionizatioiil^is led 
by researchers at Berkeley, and will consist of 128 antennas located at the Murchi- 
son Radio Observatory site in western Australia. The PAPER antennas feature 
a unique dipole -|- resonator receiving element and focusing ground plane which 
are designed to have a very predictable and stable calibration. Unlike the other 
EoR instruments, the antennas do not track the sky and PAPER operates as 
a drift scanning instrument. Because PAPER is an EoR only observatory, the 
antennas can be moved to maximize the tradeoff between sensitivity and instru- 



ment calibration (^3.2 and ^3.3). Flexibility in design and learning from early 



^http://gmrt.ncra.tifr.res.in/ 

^http: / /astro. berkeley.edu/~dbacker/eor/ 
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observations are distinct strengths of the PAPER experiment. 

1.2.3 LOFAR The LOw Frequency ARrajj^ is currently under construction 
in the Netherlands with antennas extending into several countries of northern Eu- 
rope. LOFAR has been at the vanguard of new low frequency observation, and 
is a multi-purpose observatory with a broad science case including high angular 
resolution, 10-250 MHz frequency coverage, and high speed cosmic ray detection 



(see Rottgering et al. (2006) for overall design and http://www.astron.nl/radio- 
observatory/astronomers/lofar- astronomers for up to date technical information). 
LOFAR's EoR observations will use the high band 110-250 MHz antennas in the 
central two kilometers of the array. Each high band antenna consists of a 4x4 
array of dual polarization dipoles on a ground screen and is embedded in a foam 
environmental protection system. The signals from each dipole can be delayed 
in an analog beamformer to form a ~20-|- degree antenna beam that is electroni- 
cally steered to track objects across the sky. In LOFAR the individual high band 
antennas are further grouped into stations of 24 antennas each to form 'virtual' 
~30 m antennas with multiple beams per station (the number of beams per sta- 
tion depends on available computation, and the more distant stations used for 
non-EoR observations typically have more antennas per station.) The LOFAR 
EoR measurement will use the 36 stations near the core of the array (864 anten- 
nas) and concentrate on 5 observational windows with low sky temperature and 
Galactic polarization. LOFAR has the largest collecting area of the widefield EoR 
instruments (PAPER, LOFAR, MWA) and the highest angular resolution, both 
of which are important for instrumental calibration and foreground subtraction. 



http:/ /www. lofar.org/ 
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1.2 A MWA The Murchison Widefield Arra}f] ( |Lonsdale et~aLl|2009[ ) is cur- 



rently under construction in western Australia at the radio quiet Murchison Radio 
Observatory (MWA and PAPER are walking distance apart). The MWA anten- 
nas are very similar to the LOFAR high band antennas (the telescopes share a 
technical history) , though the MWA antennas are optimized for a slightly higher 
frequency (140 MHz) and correspondingly have a slightly larger ~30° beam. The 
512 antennas of the MWA are arranged in a very compact 1.5 km semi-random 
distribution designed to maximize the quality of the point-spread-function, and 
features full cross-correlation of all antennas (130,000 baselines in 4 polarizations 
and 3000 frequency channels) and realtime holographic calibration. The full cor- 
relation allows the entire antenna field of view to be imaged, enhancing the survey 
speed of the instrument. Of the first generation EoR instruments, the MWA has 
the highest power spectrum sensitivity (by a small margin) and concentrates on 
very low systematics in its calibration and foreground subtraction. 

1.2.5 21 CM INTENSITY MAPPING Over the last couple of years there has 
been significant interest in studying cosmology via 21 cm fluctuations after reion- 
ization with EoR-style instrumentation. 

It has been recognized for some time that the Square Kilometer Array (SKA) 



could perform a billion galaxy spectroscopic redshift survey in HI ( Rawlings et al. 



2004 ) , and that such a survey could have very low systematic errors for cosmology 
(see the report of the Dark Energy Task Forc^. The proposed SKA BAO dark 
energy measurement would identify the three dimensional positions of bright 
galaxies (> L*), and use them as tracers of the underlying density fluctuations. 
The imprint of baryon acoustic oscillations on the galaxy clustering provides a 



^http://www. MWAtelescope.org/ 
®http: //www. nsf.gov/mps/ast/detf.jsp 
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standard ruler, enabling the expansion history of space to be determined out to 

a redshift of ~1.5. In addition to precision dark energy constraints, an SKA HI 

survey would provide a wealth of astrophysical data about galaxy formation and 

evolution. 



Wyithe, Loeb & Geil (20081, Chang et al. (2008) and Peterson, Bandura & Pen 



(20061 realized that compact interferometers with wide fields-of-view might be 
able to perform cosmological measurements, particularly those associated with 
the BAO scale, at a fraction of the SKA's cost. These proposed instruments have 
low angular resolution — they cannot resolve individual galaxies — and have been 
tuned to have the highest sensitivity at BAO angular scales (~150 cMpc). Some- 
what counter-intuitively the low angular resolution enhances the cosmological 
sensitivity of these instruments in two ways. First, by observing the integrated 
21 cm emission in ~150 cMpc regions they observe all of the HI emission from 
the region, not just the emission from the brightest galaxies in that volume. Ef- 
fectively the observation integrates the full luminosity function, and there is a 
lot of HI in DLAs and other small galaxies. Secondly, by moving the antennas 
to short spacings all of the baselines contribute to the measurement. This ob- 
servational technique is commonly called 'redshifted 21 cm intensity mapping' 



(Peterson et al. 20091 in reference to the focus on large scale 21 cm birghtness 
fluctuations, or just 'intensity mapping' for short. 

Current instrument concepts for 21 cm intensity mapping include CHIME 
(Canadian Hydrogen Intensity Mapping Experiment) and the related CRT (Cylin- 
der Radio Telescope), the Omniscope (formerly FFT Telescope), and CARPEQ 
(Cosmological Acceleration and Radio Pulsar Experiment) telescope. Several of 



^http:/ /www. phys. Washington. edu/users/mmorales/carpe/ 
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the arrays are being prototyped, though none have construction funding as of 

publication. 

All of the proposed 21 cm intensity mapping instruments have wide fields-of- 
view to maximize the survey speed (lOO's of square degrees), and thousands of 
receiving elements. The large number of receiving elements presents a significant 
data processing challenge because radio correlators scale with the number of re- 
ceiving elements squared. At 512 elements the MWA correlator is the largest N 
correlator ever constructed, and even Moore's law does not erase the problem 
for arrays contemplating an order-of-magnitude more receiving elements. The 
current instrument concepts are largely driven by data processing considerations, 
using either cylinders (CHIME and CRT), a regular grid of elements and FFT 
algorithms (Omniscope), or novel correlator designs (CARPE). We expect that 
instrumental calibration will become an important design driver as these con- 
cepts mature, and we hope at least one first generation 21 cm intensity mapping 
experiment will be under construction within a few years. 

In this review we have integrated the discussions of 21 cm EoR and post- 
reionization intensity mapping measurements to make the connections between 
the fields explicit. Section [2] reviews the theoretical expectations for 21 cm emis- 
sion with an emphasis on advances made in the past three years. Section [3] then 
concentrates on upcoming observations, with particular attention to the cosmo- 
logical sensitivity of interferometers, instrumental calibration, and foreground 
subtraction. 
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Figure 1: Example models for the reionization of the IGM and the subsequent 
post-overlap evolution of the ionizing radiation field that fit current constraints 
(based on Wyithe Loeb||2008 1 . Upper left panel: The ionization rate as a func- 
tion of redshift. The observational points are from Bolton Sz Haehnelt| (2007b|. 
Upper right panel: The volume (lower curves) and mass (upper curves) aver- 
aged fractions of neutral gas in the universe. The observational points for the 



volume-averaged neutral fraction are from Bolton Sz Haehnelt ( 2007b ) , while the 



observed mass fractions are from the damped Lya measurements of Prochaska 



Herbert-Fort & Wolfe (20051. Lower left panel: The predicted evolution of the 
mean 21 cm brightness temperature (in mK) with redshift. Lower right panel: 
The auto-correlation function (variance) of brightness temperature plotted as a 
function of redshift for scales of relevance for observations using upcoming low- 
frequency radio telescopes (solid and dashed lines assume smoothing scales in 
real space of 20 and 40 comoving Mpc respectively). 
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2 Evolution of the ionization structure in the IGM 



As detailed in § 1.1.1 present day constraints on the reionization of hydrogen 
are hmited to the evolution of the mean ionized fraction (see Figure [T]) . However 
within our modern theoretical framework it has been recognized that reionization 
is not a uniform process. Rather, hydrogen reionization is thought to have started 
with small ionized (HIT) regions around the first galaxies to form, and that these 
later grew to surround groups of galaxies. Reionization is said to have completed 
once HII regions overlap, occupying most of the inter-galactic volume and leav- 
ing only dense islands of neutral gas (as shown in Figure |2]). Thus the process 
of reionization is thought to be inhomogeneous, owing to both fluctuations in 
the density and to fluctuations in the ionization structure of the IGM. The in- 
homogeneities during and after reionization source 21 cm intensity fluctuations, 
facilitating the study of both galaxy formation and cosmology. 

In this section we review current theoretical expectations for the HI signal, 
concentrating on those that can be studied with upcoming observations. Sections 



2.1 -2.6 focus on the signal during reionization, which is complicated by the source 
properties of the first luminous objects, the impact of reionization and heating on 
formation of the subsequent generations of objects, radiative transfer, and other 



effects. Section 2.7 then considers the simpler post-reionization signal which is 



important for cosmological measurements and astrophysics at more recent times. 
2.1 The power spectrum 

The statistics of the 21 cm intensity field resulting from the reionization process 
are often described using the power spectrum of 21 cm fluctuations (hereafter 
referred to as the 21 cm PS). Measurement of this PS is one of the primary 
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goals for low frequency radio interferometers. Before describing the modeling 

efforts that have been undertaken to predict, and eventually interpret 21 cm 

observations, it is therefore useful to first introduce the PS of 21 cm emission 

arising from ionization structure in the IGM. 

The primordial density field as revealed by the CMB may be described as a 
Gaussian random field, in which the distribution of over-densities in the Fourier 
decomposition of the field is Gaussian at all spatial frequencies. This scale de- 
pendent variance is referred to as the PS. In the absence of ionizing sources and 
on large scales, the PS of HI follows the dark matter PS. However once the first 
ionizing sources turn on, the evolution of ionized structure evolves in a complex 
way that reflects the astrophysics of the ionizing sources as well as the cosmology 
of the underlying density field. 

To leading order in 5, it follows from equation ([T| with Tg ^ Tcmb that the 
PS of brightness temperature fluctuations is given by 

Pat = [{xliPss - 2xhi(1 - xm)Psx + (1 - xmfPxx) 

+2K/i2 [x'iiPss - XHi(l - xm)P5x) + {xmPss)] , (2) 

1 

where 7b = 23.8 [(1 -|- 2;)/10] 2 mK, and Pss, Psx and Pxx are the PS of den- 
sity fluctuations, the cross-PS of ionization and density fluctuations, and the 
PS of ionization fluctuations respectively ( |Mao et al. 20081. In equation ^ 



the velocity fluctuation has been written (to lowest order in Fourier space), as 
5v{k) = — K/i^J where ^ is the cosine of the angle between the k and line-of-sight 



unit vectors (Kaiser 19871, ancij k = dlog (^/dlog (1 -|- z). As with traditional 



The quantity k is close to unity at high redshifts, taking values of 0.974, 0.988 and 0.997 at 
z = 2.5, 3.5 and 5.5. 
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galaxy redshift surveys the term Pss encodes the cosmology. However during the 

EoR the terms Psx and P^x are sensitive to the astrophysics of the reionization 

process, and represent the cross-correlation between the clustering of galaxies 

and the resulting ionized structure, and the auto-correlation of ionized regions 

respectively. 

Late in the reionization process the terms Psx and Pxx dominate 21 cm fluc- 
tuations, and so the formation of large HII regions has a significant effect on 
the shape of the 21 cm PS. This effect arises for two reasons. First, information 
about small scale features in the density field is erased from the signal originating 
within the HII regions. In addition, the creation of HII regions imprints large 
scale fluctuations on the distribution of 21 cm intensity. The sum of these effects 
is to move power from small to large scales, leaving a shoulder shaped feature on 
the PS at the characteristic scale of the HII regions that moves to larger scales as 



the bubbles expand during the latter stages of reionization (Furlanetto, Zaldar 



riaga Sz Hernquist|2004b"l ) . This evolution is illustrated by the sequence of power 



spectra shown in Figure [3] Unlike CMB PS measurements, HI EoR measure- 
ments can observe a series of redshifts, creating a coarse power spectrum 'movie' 
of reionization. The dynamics of the PS as reionization progresses provide crucial 



information for constraining models of reionization (Lidz et al. 2008 1 . 



2.2 Modeling the ionization structure to predict the 21 cm signal 

In anticipation of forthcoming 21 cm EoR observations, a great deal of theoretical 
attention has focused on the prospects of measuring the 21 cm PS. This theo- 
retical work can be divided into analytic studies, full numerical simulations, and 
semi- numerical approaches. 
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2.2.1 Analytic models We begin by discussing analytic descriptions of the 

reionization process. A very successful analytic model describing the statistics of 
ionization structure during hydrogen reionization (the bubble model) was intro- 
duced by Furlanetto, Zaldarriaga Sz Hernquist ( 2004b|a |. This model computes 
the mass function of HII regions using an analogy of the excursion set formalism 
for computing the dark matter halo mass function. The crucial difference between 
the bubble model formalism and the standard excursion set formalism is that the 
barrier is a (decreasing) function of HII region size, instead of a constant. This 
feature gives HII regions a characteristic size rather than the power-law behavior 



seen at small masses in the Press Sz Schechter (1974 1 mass function. The charac- 
teristic scale (which is much larger than expected for an individual star forming 



galaxy, Wyithe Sz Loeb 20051 is found to depend primarily on the ionization 
fraction. 

The bubble model is based on one point statistics and naturally produces the 
probability distribution for sizes of HII regions. However in order to compute 
the 21 cm PS the bubble model employs approximate expressions for the spatial 
correlation of ionizations ( Furlanetto, Zaldarriaga Sz Hernquist||2004b ) which are 



grafted externally onto the underlying formalism. To overcome this, Barkana 



(20071 found a solution for 2 correlated random walks, and calculated the 2-point 



correlation function (and hence PS) directly. As discussed in Barkana (20091, the 
analytic and numerical calculations do not agree in detail. On the other hand 
numerical simulations do not yet agree among themselves at a comparable level 



(Iliev et al. 20061. 



Modeling of the 21 cm PS (Figure [3| indicates that the 21 cm PS has features 
that vary non-monotonically, providing opportunities for a range of parameters 
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to be measured. Owing to the computational efficiency of tlie extended bubble 



model, Barkana (20091 argued that observations of the evolving 21 cm PS could 
therefore be used to constrain critical physical parameters in the reionization 



process. To achieve this Barkana (2009) suggested that the analytic model could 
be tuned to provide a sufficiently precise description through comparison with 
numerical simulation (with the caveat that the simulations have converged to a 
reliable prediction). Under this assumption, a fast analytic model could play the 
same role with respect to identifying reionization parameters from observations 
of a 21 cm PS that the code CMBfast does for the CMB. However in addition to 
issues of precision, it should be noted that analytic models for the 21 cm PS do not 
yet include all possibilities, either for the ionizing sources or the contributions 
to the 21 cm signal which could lead to additional systematic uncertainty in 
inferences of source population from the 21 cm observations. We return to discuss 



this point further in § 2.4 



On scales much larger than the bubble size analytic models have also been 
used to investigate the probability distribution for 21 cm brightness tempera- 
ture. Galaxy bias is found to lead to enhanced reionisation in over-dense regions. 
Moreover, because galaxy bias operates most strongly on the exponential tail of 



the Press Schechter (1974| mass function the resulting enhancement is not 
linear with over-density, leading to a non-Gaussian distribution of 21 cm bright- 



ness temperature in intensity maps of reionisation. Wyithe Sz Morales (2007 1 
demonstrate that skewness of the intensity distribution is present on all scales, 
and during the entire reionisation era. This large scale skewness is related to, 
but is different from from the excess power (or shoulder) induced in the PS at 
small scales following the appearance of HII regions ( [Furlanetto, Zaldarriaga~fc 
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. Importantly, neither the skewness nor the variance (calculated 

within real-space volumes) are monotonic with redshift or neutral fraction, even 

in a simple reionisation scenario. 

The sum of these analytic results describes the following fundamental features 

that form the basis of our modern picture for the structure of the reionization 

process: 

• Large-scale, over-dense regions near sources are generally reionized first, 
while the under-dense regions are reionized via the overlap of neighbouring 
HII regions near the end of the reionization process. 

• Sources which are massive tend to be surrounded by an over-density of more 
numerous sources owing to galaxy clustering. These increase the size of HII 
regions beyond what would be expected from the massive source, and can 
dominate the ionizing contribution to some HII regions. 

• Non-Gaussianity is a generic feature of 21 cm fluctuations during the EoR, 
both on scales that are comparable to, and much larger than individual HII 
regions. 

2.2.2 Numerical Models While the analytic studies described above re- 
veal many of the generic features of the reionization process, the inhomogeneous, 
non-linear and non-Gaussian nature of reionization ensures that detailed model- 
ing and understanding of the ionization structure requires numerical simulation. 
There are a range of different techniques applicable. Solving the reionization 
problem requires N-body techniques to calculate the evolution of dark matter, 
hydro-dynamical techniques to model the gaseous component, and an algorithm 
for radiative transfer to solve evolution in ionized structure. The challenge for 
modeling of the reionization process via direct simulation is to resolve all of the 



Hernquist|2004b 
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spatial scales involved. In particular, high resolution is required to resolve small 

scale structure such as galaxies, as well as the Ly-limit systems that regulate 

radiation propagation and so form one of the key factors regulating reionization. 

At the same time, large simulation volumes are necessary in order to contain a 

representative distribution of galaxies and HII regions. Illustrating the difficulty 

of these combined requirements is that both analytic models and simulations 

have shown HII regions to reach characteristic sizes (lO's of Mpc) that contain 

a mass 7 orders of magnitude larger than the halos with a virial temperature of 

T = lO^K that are thought to drive reionization. 

Experience has shown that a realistic study of the large-scale properties of 

reionization requires a minimum box size of around 100 co-moving Mpc on a side. 

This minimum size is set by 2 independent requirements. First, as mentioned 

above the typical HII region size is found to be tens of co-moving Mpc during 



much of the EoR (Furlanetto, Zaldarriaga Sz Hernquist 2004b I . If one wants to 
study the structure and statistics of HII regions during the EoR, the simulation 
must contain many of these HII regions. The second reason concerns cosmic 



variance. As pointed out by Barkana Sz Loeb (2004 1 a simulation which is too 
small does not contain the large scale density fluctuation modes which drive the 
clustering of galaxies on the largest scales. An undersized simulation will predict 
an unphysically rapid reionization, owing to periodic boundary conditions which 
remove the corresponding large scale fluctuations in the radiation field. 

Over this past decade, significant progress has been made in modeling the 



effect of galaxies on the reionization of the IGM. Pioneering studies (e.g. Ciardi. 



Stoehr Sz White 2003, Sokasian et al. 20031 were restricted to small box sizes 



of only ~ 10 — 20 Mpc, with a consequently small number of sources. However 
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the results of these works illustrated that the abundance and luminosities of 

galaxies clearly affect the reionization process. In large modern simulations, the 

most common approach to address this issue is to begin with an N-body code 



to generate a distribution of halos (e.g. Ciardi, Stoehr &; White 2003 Sokasian 



et al. 2003 Ihev et al. 2007 Zahn et al. 2007 Trac k Cen 2007). A simple 



prescription is then used to relate halo mass to ionizing luminosity. Following 
this step, radiative transfer methods (most commonly ray-tracing algorithms) 
are employed to model the generation of ionized structure on large scales. The 
radiative transfer is normally run with lower resolution than the N-body code for 
computational efficiency. 

In the last few years, the critical box size of 100/i~^ Mpc for reionization sim- 



ulations has been reached in this way (e.g. Iliev et al. 2007 Zahn et al. 2007 



McQuinn et al. 2007 1 . More importantly, in just the last year simulations have 



become sufficiently large that they have met both the box size and mass res- 
olution requirements ( [Shin, Trac Sz Cen 2008 Iliev et al. 2008a Trac, Cen & 



Loeb 2008 ) . In Figure [2] we reproduce a sample visualization of the large scale 



simulation presented in |Trac Sz Cen| (2007|. Simulations such as this describe 



Iliev et al. 


2007 


Zahn et al. 


2007 


Mc- 



Quinn et al. 2007, Shin, Trac k Cen 2008, Lee et al. 2008, Croft k Altay 20081, 



confirming expectations from analytic models that large-scale, over-dense regions 
near sources are generally reionized first, and that massive galaxies tend to be 
surrounded by clustered sources that increase the size of HII regions. In addition, 
the simulations describe the structure of the HII regions, showing that they are 
generally aspherical (even where the sources are assumed to emit isotropically) . 
The evolution of ionized structure is affected by the presence of Ly-limit sys- 
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terns which absorb the ionizing photons before they can contribute to ionization 



of the diffuse IGM. This effect has been studied by McQuinn et al. (2007) and 



Croft Sz Altay (2008 1 using a range of simulations, and considering different mod- 



els for both the generation and absorption if ionizing photons. These studies show 
that the size and structure of ionized bubbles at fixed global neutral fraction is 
more strongly influenced by ionizing source properties than by the presence of 
Ly-limit systems, whose impact is to slow the progression of reionization. The 
growth of HII regions during reionisation may also be influenced by radiative 
feedback in the form of suppression of galaxy formation below the cosmological 
Jeans mass within a heated IGM ( Dijkstra et al.|[2004 ), although the importance 



of this effect remains controversial (Mesinger Sz Dijkstra 2008). Suppression of 
low mass galaxy-formation delays and extends the reionization process, which 
though started by low mass galaxies, must then be completed by relatively mas- 



sive galaxies (Iliev et al. 2007 1 . Conversely, ionization also reduces the clumping 



factor of the IGM (Pawlik, Schaye Sz van Scherpenzeel 20091 by increasing the 
pressure support within dense systems. The balance of these effects is critical for 
understanding the detailed process of reionization. 

An important outcome from the large cosmological volumes attained by mod- 
ern numerical simulations has been the prediction of 21 cm signals that will be 
observable using forthcoming low frequency arrays (e.g. [Mellema et al.|2006| |Lidz 



et al.||2008[ ). The most generic features of 21 cm PS modeling were elucidated by 



Lidz et al. (2008 1, who show examples of its evolution. On scales of ~ O.lMpc 



-1 



the terms Psx and Pxx modify both the amplitude and the slope of the PS in a 
non-monotonic way relative to the expected shape in the absence of ionization 
structure (see Figure E|. Thus, measurement of these terms will provide the 
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first clues regarding tlie clustering of ionizing sources during the EoR. Lidz et al. 



(20081 illustrate that the slope and amplitude of the 21 cm PS vary considerably 
among different models at a given ionization fraction. However they also find 
that the behavior with ionization fraction across the different models is relatively 
generic. In particular, the amplitude of the 21 cm PS reaches a maximum close 
to the epoch when ~ 50% of the volume of the IGM is ionized, while its slope 
is found to flatten with increasing ionization fraction as seen in Figures |3] and |4j 



Lidz et al. (20081 argue that a first generation EoR telescope like the MWA has 
sufficient sensitivity to measure the redshift evolution in the slope and amplitude 
of the 21 cm power spectrum. Moreover the characteristic redshift evolution ev- 
ident in models of reionization that are driven by star formation will allow the 
mean ionization fraction near the center of the EoR to be constrained to a level 
of ~ 10%. 

Finally, in addition to the full 3 dimensional structure of ionization during the 
EoR, numerical simulations are able to follow the thermal properties of the IGM. 
The simulations shown in Figure |2] illustrate that the IGM in over-dense regions 
is photo-heated earlier than large-scale, under-dense regions far from sources. 
Thus, conditions in the Lya forest at z < 6 can be used to probe the ionization 
state during the EoR and to tell the difference between different scenarios for the 



reionization history (Trac, Cen Sz Loeb 20081. 



2.2.3 Semi-numerical models In order to overcome the hmitations of 



both analytic and numerical methods Mesinger & Furlanetto (20071 introduced 



an approximate, but efficient method for simulating the reionization process. This 



so-called semi-numerical method extends prior work by Bond Sz Myers (1996) and 



Zahn et al. (20071, and has 2 parts. First, an excursion-set approach (e.g. Lacey 
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&: Cole|1993 1 is applied to identify halos within a realization of the linear density 



field, following which halo locations are adjusted with first-order perturbation 
theory. The method therefore generates halo distributions at a particular red- 
shift, without explicitly including information from higher redshift. Despite this 
simplification, the mass and correlation functions of the resulting halo population 
agree almost perfectly with results from N-body simulation. 



Mesinger Sz Furlanetto (20071 then estimate the ionization field based on a 



catalogue of sources assigned within the halo field (an idea previously used by 
Zahn et al. 2007[ ) by applying a filtering technique based on the analytic bubble 



model. Figure [5] illustrates the structure of the ionization field obtained, includ- 
ing comparison of different variations on the filtering method and full radiative 
transfer. There is good agreement, implying that semi-numerical models can 
be used to explore a larger range of reionization scenarios than is possible with 
current numerical simulations. Importantly, these semi-numerical models retain 
information on the spatial distribution of sources and ionization structure that 
are not available via purely analytic modeling. 



Thomas et al. ( 2009^ introduced a related technique in which a large N-body 



simulation is used as the basis for construction of the ionization field in post- 
processing. In this scheme sources are again assigned to the halos with appropri- 
ate spectral properties. However rather than adopt a filtering scheme, a library 
of pre-computed 1-dimensional radiative transfer results are used to generate HII 
regions based on the spherically averaged radial density profile. A feature of this 
model is that overlap of neighbouring HII regions is treated self consistently with 
respect to photon conservation (photon conservation is not guaranteed within the 
filter based semi-numerical methods described above). 
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Recent work by Choudhury, Haehnelt &: Regan (20091 provides an excellent 



example of the utility of semi-numerical modeling. These authors looked at the 
effect of a position dependent clumping factor on the structure of reionization. 
Their main point of departure from previous work is the enforcement of con- 
sistency with the Lya forest data, which leads to a photon starved reionization 
( Bolton Sz Haehnelt||2007b |. In the photon-starved regime, Choudhury, Haehnelt 



Sz Regan (20091 find that recombinations are much more important than in mod- 
els where reionization occurs quickly, and that accounting for spatially inhomo- 
geneous sinks of ionizing radiation has a large effect on the topology in this case. 
Reionization is found to proceed inside-out during the early stages. However in 
the photon starved regime the sinks of ionizing radiation can remain neutral so 
that reionization proceeds deep into the under-dense regions prior to the comple- 
tion of overlap. 

2.3 Comparison of modeling methods for the EoR 

Given the various complementary methods in use for simulation of the EoR and 
the impending comparison between theory and observation, it is timely to com- 
pare the theoretical situation with the related case of galaxy clustering and large- 
scale-structure. Galaxy redshift surveys have reached a point of observational 
maturity where full N-body simulations of structure formation are required to 
provide a sufficiently precise theoretical model. Given the correct cosmological 
model, N-body simulations are able to reproduce the details of galaxy clustering 
because the source of structure in the density field, namely gravity, is well under- 
stood. However both simulations and observations of the the EoR are currently in 
a different position. Firstly, for the foreseeable future 21 cm based observational 
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constraints on the EoR will be crude by comparison with modern galaxy redshift 

surveys. Moreover, the numerical problem is less precisely posed than a calcu- 
lation of structure formation, because the source of structure during the EoR is 
provided by radiation from galaxies (which act via radiative transfer) , rather than 
by gravity. However in difference to the structure formation case where gravity is 
not generally the topic of study, understanding the galaxies which drive reioniza- 
tion (i.e. the source of structure) represents the primary scientific goal for studies 
of the EoR. Currently, the properties of these early galaxies are very uncertain 
even as regards their qualitative features (e.g. mass, stellar initial mass function, 
contributions from UV photons verses X-rays etc.). Models of the EoR that em- 
ploy different assumptions are therefore able to predict a wide range of scenarios. 
Indeed, the range of predictions for quantities like the 21 cm PS is much larger 
than the uncertainty in calculations of a particular model, and is also larger than 
the difference between different modeling techniques. 

Different methods for performing calculations of the reionization process each 
have their own advantages and disadvantages, and so it is important to compare 



predictions where possible. Analytic models utilize the Press Sz Schechter (1974| 
formalism (with extensions) and operate under the premise that the production of 
ionizing photons is proportional to the formation of collapsed, bound structures 



(Barkana Sz Loeb 2001 1 . It must therefore be recognised that while analytic 
models are based around a linear theory of structure formation, both the collapsed 
structures and the response of the IGM to the ionization field are explicitly non- 
linear. As a result, it is crucial that conclusions reached via analytic studies are 
checked against simulations. Never-the-less, analytic models are able to describe 
the primary features of the reionization process from a fundamental point of view. 
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Moreover, as described above, simulations with sufficient volume to accurately 

represent the process of reionization have only recently been performed due to 

the enormous dynamic range the simulations must address. The large amount 

of computation limits the number of model parameters which can be explored. 

To overcome this limitation it is often advantageous to utilize analytic models, 

which are able to provide a fast exploration of parameter space in many of the 

unknown quantities describing early galaxy formation. 



Santos et al. (2007) have compared the results of analytic models to numerical 
simulations, both in the high redshift regime where the spin temperature is an 
important consideration, and at lower redshifts where they find that it is not 
important (Tg ^ Tcmb)- Moreover at the redshift range that will be probed by 



planned 21 cm experiments (i.e. z < 9), Santos et al. (20071 find that the simple 



analytical models described above coupled with fitting functions to describe the 
ionization fraction PS are able to reproduce the results from numerical simulation. 
The implication is that at these redshifts, there are no impediments regarding the 
use of an analytical (or semi-numerical) model to explore the parameter space 
relevant for future EoR observations (at least when restricted to using estimators 
based on the 21 cm PS alone.) 

2.4 Additional contributions to ionization and the power spec- 
trum 

The modeling of the ionization structure and 21 cm PS discussed thus far has 
centered around the assumption that the ionizing photons are dominated by UV 
flux from star-forming galaxies. In this section we discuss some possible scenarios 
which complicate this picture, and hence threaten to muddy the interpretation 
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of 21 cm observations. We again restrict our attention to the domain late in the 

EoR when the spin temperature of the HI is thought to be much larger than the 



CMB (i.e. z < 9 [Santos etar][2007 |. 

2.4.1 X-RAYS The HIT region dominated ionization structure of the IGM 
seen in numerical simulations (e.g. Figure [2]) forms as a result of the biased 
clustering of sources, combined with the short mean free path (MFP) of UV pho- 
tons. However in addition to UV photons from the first galaxies and quasars, 
it has also been suggested that a background of X-ray photons may have been 
produced at high redshift, primarily from black hole accretion. Unlike UV pho- 
tons. X-rays ionize hydrogen both directly and through secondary ionizations by 
photo-electrons from ionized helium, with the latter dominating so that many 
hydrogen atoms can be ionized by a single X-ray photon. For ionization fractions 



> 10%, ionization by secondary electrons becomes inefficient (Ricotti & Ostriker 



2004) and X-ray ionization is self regulating at the level of ~ 10 — 20%. Obser- 



vational limits on the contribution of X-rays to reionization are provided by the 
unresolved soft X-ray background, which limits the possible level of ionization 



by X-rays from a population of black holes at high redshift (Dijkstra, Haiman Sz 



Loeb 2004). 



Importantly, X-rays also differ from UV photons in their effect on the IGM 
due to their comparatively long MFP, which means that X-rays provide a weakly 
fluctuating radiation background. For this reason, the inclusion of an X-ray 
component of the emission responsible for the reionization of the IGM could 
modify fluctuations in the 21 cm PS by reducing the influence of strongly biased 



ionized regions (Warszawski, Geil & Wyithe 2009). The details of the 21 cm 
PS shape are found to depend on the typical X-ray photon energy, while the 
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magnitude of modification to the PS is determined by the relative contribution 

of X-rays to the EoR. Thus the possible contribution of X-rays has the potential 
to substantially complicate analysis of 21 cm PS observations. 

2.4.2 Quasars Luminous quasars are known to exist at the edge of the 
reionisation epoch ( Fan et al.|2006 1, although the exact role that they play in the 
reionisation process remains uncertain. It is thought that the observed quasar 
population could not have supplied sufficient UV photons to reionize the Universe 



at z > 6 (e.g. Loeb 20091. On the other hand, high-redshift quasars have a 



large clustering bias ( Shen et al. 2007 1 , implying that the effect of their ionising 



contribution on the ionization PS may be significant during the latter part of 



the pre-overlap era. Indeed, Geil &i Wyithe (20091 have shown that with only a 



10% contribution to reionization, the cumulative ionising effect of quasars could 
result in a scale dependent suppression of the 21 cm PS by up to 30 per cent at 
constant global neutral fraction. 



2.4.3 Galactic HI In the modeling described in § 2.2 only the component 



of HI residing in the IGM was considered in forecasting the statistical 21 cm 
signal. However, after the completion of reionisation there is known to be a 
residual HI fraction of a few percent in high density clumps which are believed 
to reside within galaxies (e.g. Prochaska, Herbert- Fort Sz Wolfe|2005 1 , and which 
are identified as DLAs. This high density, galactic contribution to the HI content 
of the Universe must also be present during the reionisation era, and should 



therefore be included in predictions of 21 cm fluctuations. Wyithe et al. (20091 



investigated the impact of HI in galaxies on the statistics of 21 cm fluctuations 
using semi-numerical models. Assuming that 2% of hydrogen is in the form of HI 
and located within galaxies during the reionisation era (approximately the mass 
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weighted fraction in the range 1 < z < 5), a scale dependent reduction of 10-20% 

is found in the amphtude of 21 cm fluctuations. Moreover, because the galactic 

HI is biased towards HII regions, inclusion of galactic HI is found to decrease the 

prominence of the HII region induced shoulder in the 21 cm PS. 



2.5 Other 21 cm signatures of reionization 



Thus far we have concentrated this review on observation of the PS of 21 cm 
fluctuations together with its evolution with redshift. However there are several 
other observational signatures that have been suggested. 
2.5.1 Cross-correlation of galaxies with the 21 cm signal The 
cross- correlation of 21 cm emission with galaxies would directly probe the con- 



nection between reionization and the sources of ionizing radiation (e.g. Wyithe Sz 



Loeb 2007 Furlanetto k Lidz 2007 Lidz et al. 2009). For example, the generic 



model prediction that over-dense regions will be ionized early (see § 2.2) leads to 



an anti-correlation between 21 cm emission and the galaxy population. Wyithe| 



Sz Loeb (2007) argue that the first surveys should be sensitive enough to tell 



the difference between "outside in" verses "inside-out" reionization, even when 
combined with galaxy surveys of only several square degrees. 



McQuinn et al. (2007) have considered the specific case of a cross-correlation 



of Lya emitters with 21 cm emission. Their numerical simulations include the 
important effects of IGM opacity to the Lya line. They find that the large HII re- 
gions formed during the EoR modulate the observed distribution of Lya emitters 



and boost their observed angular clustering. Subsequently, Iliev et al. (2008b) 



have also studied the cross-correlation of Lya emitters with 21 cm emission us- 



ing a large numerical simulation. In difference to McQuinn et al. (2007) these 
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authors find a minimal effect of patcliy reionization on tlie clustering of Lya emit- 
ters, which may be attributed to the large scatter in opacity along different sight 



lines, or to uncertainties in modeling of IGM opacity (Dijkstra, Lidz Sz Wyithe 



20071. 



Furlanetto Sz Lidz (2007) discuss the properties of cross-correlation between 



galaxies and fluctuations in the 21 cm signal more generally, and outline the main 



points of possible advantage relative to a PS analysis (see also Slosar, Cooray &: 
Silk|2007 for a discussion of the cross-correlation between 21 cm emission and the 
IR background or Sunyaev-Zel'dovich effects). They find that the signal-to-noise 
ratio achieved would exceed that of the 21 cm PS by a factor of several, provided 
that a suitable galaxy survey is available over the large field of view (lOO's of 
square degrees). Moreover the cross-correlation would allow more efficient de- 
tection of inhomogeneous reionization. Finally, detection of a cross-correlation 
function would also unambiguously identify the cosmological nature of the 21 cm 
fluctuations. 

2.5.2 Imaging of quasar HI regions The most striking example of a 
cross-correlation between the 21 cm signal and astrophysical sources will be pro- 



vided by the imaging of quasar HII regions. As mentioned in § 1.1.1, several of 
the most distant known quasars exhibit evidence for the presence of an HII region 
in a partially neutral IGM (e.g. Cen Sz Haiman|2000 1 , although the interpretation 



remains uncertain ( Lidz et al.|2007 Bolton Sz Haehnelt|2007a |. A better probe of 



the neutral IGM in the quasar environment during the EoR will be provided by 
redshifted 21 cm emission, particularly since quasar HII regions are predicted to 
be the most prominent individual 21 cm features ( Kohler et al.|2005 ). It has been 



argued that these signatures will be most readily detected a-posteriori, around 
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known high redshift quasars (Geil et al. 2008). Indeed, patchy reionization ren- 



ders bhnd detection of HII regions almost impossible for neutral fractions smaller 



Datta et al. 


2008 


)• 


Valdes et al. 


(2006 


) and 


Geil et al. 


(2008 



have studied the impact of a percolating IGM on the detection of HII regions 
around known quasars, and shown that large (many arcminute) HII regions will 
provide a detectable imprint on the IGM until very late in the reionisation era, 
provided that their location is known a-priori. As an example, slices through a 
model of 21 cm emission from the IGM surrounding a high redshift quasar are 
presented in the upper row of Figure [6} The evolution of the IGM is clearly seen 
in this figure, with the percolation process completing between the "back" of the 
box and the "front" of the box (xa coordinate, corresponding to the line-of-sight 
direction) . 

Quasars have a harder spectrum than stars, and as pointed out by Zaroubi &: 



Silk (20051 and Kramer Sz Haiman (20081, this leads to thicker ionizing fronts 



than is the case for a star burst driven HII region. These authors argue that 
with sufficient angular resolution the contribution of hard ionizing sources such 
as mini quasars to the EoR could be inferred from the structure of ionizing fronts 
at the edge of HII regions. Moreover, quasar HII regions will last for a long time 
following the epoch when the quasar turns off. Fossils of quasar HII regions in an 
inhomogeneous IGM at z < 10 would maintain an ionization state greater than 



for around a Hubble time following the death of their quasar ( Furlanetto,| 
Haiman Ohl[2008l ). While imaging of the resultant networks of HII regions (as 
opposed to a single quasar HII region) is beyond the reach of the first generation 
EoR observatories ( ^3.2[ ), this will be a primary goal of second generation of EoR 
telescopes. 
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2.5.3 PDF OF 21 CM INTENSITY FLUCTUATIONS The distribution of fluc- 
tuation amplitudes is expected to be Gaussian early in the EoR. At this time 
the PS is the natural statistic to describe fluctuations in 21 cm emission, since 
it contains all the statistical information. However the distribution of fluctu- 



ation amplitudes becomes non-Gaussian as reionization progresses (Wyithe Sz 



Morales 2007, Harker et al. 2009), particularly on small scales when HIT regions 



are formed ( Furlanetto, Zaldarriaga Hernquist||2004a Bharadwaj Ali||2005 | 



As a result, the PS does not provide a complete statistical description of the 
reionization process. Some additional information could be captured by skew- 
ness in the probability distribution ( Wyithe &: Morales||2007 Harker et al. [2009 1. 



Estimates in these papers indicate that MWA and LOFAR will have sufficient 
sensitivity to detect the evolution in the skewness, though foreground subtraction 
has yet to be considered in depth. 



Ichikawa et al. (20091 extended these ideas by calculating the evolution of the 
probability distribution of intensity fluctuations using a numerical simulation of 
cosmic reionization. The intensity distribution has a Gaussian shape at high 
redshift, and then develops an exponential tail at low intensity as HII regions 
form. Towards the end of reionization as HII regions dominate the volume, the 
distribution becomes strongly peaked at a neutral fraction of zero. 

2.6 Cosmology with EoR observations 

While the prime motivation for EoR experiments has been measurement of the 
PS to probe the astrophysics of the reionizing sources, a number of authors have 
explored the possibility of measuring cosmological parameters from the shape of 
the underlying matter PS. 
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Bowman, Morales &: Hewitt (20071 considered the possible constraints on cos- 



mological parameters that would be available from measurements of the spheri- 
cally averaged 21 cm PS. They found that even in a scenario where the PS was 
dominated by density fluctuations (i.e. before reionization is underway), planned 
telescopes like the MWA would not be able to make interesting cosmological 
measurements. Similarly pessimistic results were presented by Santos &: Cooray] 



(20061 who found significant degeneracies between astrophysical and cosmologi- 
cal parameters in the angular PS. Moreover, during the the EoR, the PS of 21 cm 
brightness fluctuations is shaped mainly by the topology of ionized regions, rather 
than by the PS of matter density fluctuations which is the quantity of cosmolog- 



ical interest ( [McQuinn et al.||2006[ [Santos et al.||2007[ [Iliev et al.||2007[ ). With 
respect to measurement of the mass PS, the appearance of HII regions compli- 
cates the interpretation of a measured PS by generating scale dependence in the 
effective bias that relates the 21 cm PS to the linear mass PS. 

On the other hand, peculiar velocities of the 21 cm emitting gas are sensitive 
only to the mass density and not to the ionization structure. As a result, the 
line-of-sight anisotropy of the 21 cm PS due to peculiar velocities may be used 
to separate measurements of the density PS from the unknown details of the 
astrophysics driving the growth of HII regions dBarkanafc Loeb|2005p . Although 
the dependence of the PS on angle is nontrivial (see equation [2]) , the cosmological 
signal could be extracted by isolating the term proportional to /x^. However 



McQuinn et al. (20061 found that planned low frequency arrays like MWA and 



LOFAR will not be sensitive to the /i^ term, and so could only provide useful 
cosmological constraints in a situation where density fluctuations dominate the 
21 cm PS. 
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More recently, Mao et al. (2008) have shown that parametrization of the ioniza- 
tion PS could be used to model the full 21 cm PS in order to extract cosmological 
constraints. The required parametrization is motivated by the generic form ex- 
pected for the ionisation PS, and in practice adds several free parameters to the 
cosmological fit. Although the method relies heavily on the applicability of the 
reionization model, these authors show that marginalizing over the additional 
parameters is more effective than using only the //^ term in the 21 cm PS (equa- 



tion |2|. Indeed, Mao et al. (2008) show that parameterization of the ionizations 
structure may provide constraints on the cosmology that are almost as tight as a 
case where 21 cm tomography measures the matter PS directly. However cosmo- 
logical constraints that are competitive with existing measurements (from CMB 
etc.) will still require larger arrays than those currently planned. 

2.7 The 21 cm fluctuations from the post reionization IGM 

Until recently, the conventional wisdom has been that the 21 cm signal disap- 
pears after the end of the EoR, because there is little HI left through most of 
intergalactic space. However the redshifted 21 cm emission from a volume of 
IGM is sensitive to xhi, which is non zero at all cosmic epochs (e.g. Figure [T|. 
The following simple estimate can be used to demonstrate that 21 cm emission 
should be significant even after the EoR is completed. 

Observations of DLAs out to a redshift of z ~ 4 show the cosmological den- 



sity parameter of HI to be JIhi ~ 10 ^ (Prochaska, Herbert- Fort Sz Wolfe 2005) 



This corresponds to a mass-averaged HI fraction of xhi ~ 0.03, and a brightness 
temperature contrast of redshifted 21 cm emission of AT ~ 0.5mK (see equa- 
tion [T]). Moreover on ~ 10 co-moving Mpc scales the typical amplitude of density 
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fluctuations at z ~ 4 is o" ~ 0.2. Hence, we expect fluctuations in the radiation 

field of ~ O.lmK after reionization on scales relevant for upcoming 21 cm experi- 
ments. As seen in Figure [T| these fluctuations are only an order of magnitude or 
so smaller than the largest fluctuations predicted at any time during the entire 
EoR. At the same time the sky brightness (which limits the observational sensi- 
tivity, see 



3.4.1) will be ~ 5 times smaller owing to the frequency dependence 



of the foreground synchrotron emission. Thus, detectability of fluctuations in 
21 cm emission may not decline substantially following the overlap epoch. The 
post-reionization HI could therefore be studied via 21 cm intensity mapping using 



telescopes like those described in § 1.2.5 



The detectability of the 21 cm PS after reionization was discussed by |Khandai. 



Datta Sz Bagla (20091. These authors used a numerical simulation to predict 



the statistical signal of 21 cm fluctuations in the post-reionization IGM, and 



estimated its detectability by telescopes like the MWA and GMRT. Khandai 



Datta Sz Bagla (2009) find that a combination of these arrays offer good prospects 



for detecting the 21 cm PS over a range of redshifts in the post reionization 
era. Importantly, a statistical detection of 21 cm fluctuations due to discrete, 
unresolved clumps of neutral gas has already been made ( Pen et al.|2009 ) through 



cross-correlation of the HIPASS (Barnes et al. 2001) 21 cm observations of the 



local universe with galaxies in the 6 degree field galaxy redshift survey (Jones 



et al. 2005). This detection represents an important step towards using 21 cm 



surface brightness fluctuations to probe the neutral gas distribution in the IGM 
(both during and after reionization). 

2.7.1 The 21 cm power spectrum after reionization The form of 



the post-reionization PS can be derived in analogy with the PS during the EoR 
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(Wyithe Sz Loeb 2009). To begin, the fluctuation in ionization fraction may be 
written as 

6. ^ [xm/{xm - 1)] [^(1 - K{k)) - 1] 6. (3) 

wliere b is tlie clustering bias of the HI relative to the underlying density field. 
The function K is introduced to quantify the effect that the ionizing background 
should have on the neutral fraction. Fluctuations in the HI content of individual 
DLAs can be shown to be proportional to fluctuations in the ionizing background 
(which has a small effect on the density at which the DL A becomes self shielded) . 
By convolving the real space density field with a filter function it is then possible 
to calculate the function K, which becomes 



Kik) = ^ /^-^tan(fcA^fp) ^ 

"''^mfp 



where k = \k\, and Amfp is the ionizing photon mean- free-path. In Wyithe &: 



Loeb] ( [20091 ) it was shown that for realistic DLA profiles the constant Kq has a 



value smaller than Kq ~ 0.01. 

From equation ([s]) we see that the fluctuation in ionization is proportional to 
the fluctuation in density up to a weak (and simple) scale dependence, yielding 
an expression for the direction dependent post-reionization 21 cm PS 

PATib) = r^xl^ [b(l - K{k)) + Kf?]^Ps5, (5) 



where terms are defined as per equation (|2]). Thus the 21 cm PS after reionization 
has a form that is very similar to the familiar galaxy PS in the regime of scale 
independent halo bias. 
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2.7.2 Cosmology with 21 cm fluctuations after the EoR The sim- 
plicity of equation ^ allows near direct observation of the mass-PS, and hence 
the full 21 cm PS to be utilized in deriving cosmological constraints. This com- 
pares favorably with the EoR, during which either the fi^ term alone can be 
utilized ( |Barkana Loeb||2005p , or a functional form for the ionization PS must 
be assumed ( Mao et aL]|2008 1. The resulting utility of the 21 cm PS from HI in 
the post-reionization Universe has been pointed out by a number of authors. 
For example, measurement of the scale of baryonic acoustic oscillations using 



21 cm fluctuations in the post-reionization Universe was discussed by Wyithe, 



Loeb Sz Geil ( 2008 ) who focused on redshifts z > 3 where telescopes of the design 



of MWA could operate, and by Chang et al. (2008) who looked at measurement 
of the oscillations in the redshift range z < 2 where measurements best constrain 



conventional dark energy. Chang et al. (20081 argued that a radio telescope with 
a diameter of ~ 200 wavelengths would provide an efficient method for surveying 
huge volumes, and so provide constraints on dark energy parameters that would 
be competitive with planned galaxy redshift surveys. 



The studies of jLoeb k Wyithe] ( p008| ) and [Visbal, Loeb k Wyithe] ( p008| ) fo- 
cused on cosmological constraints from studies of the full PS using futuristic 



arrays beyond the first generation. Loeb k Wyithe (20081 noted that a ded- 
icated HI cosmology observatory could probe a number of independent modes 
two orders of magnitude larger than currently available, and that this would en- 
able a cosmic-variance limited detection of the signature of a neutrino mass as 



small as 0.05 eV. Visbal, Loeb k Wyithe (2008) provide a detailed forecast of the 
constraints on cosmological parameters that would be achievable, and find that 
HI cosmology telescopes dedicated to post-reionization redshifts may yield signif- 



Cosmic 21 cm Fluctuations 41 
icantly better constraints than next generation Cosmic Microwave Background 



(CMB) experiments. In a complementary study, Bharadwaj, Sethi &; Saini (20091 
focused on the possibihty of using 21 cm fluctuations to determine the redshift- 
space distortion parameter, as well line of sight verses transverse co-ordinate 
distances. Using the Alcock-Paczynski effect combined with sensitivity estimates 
corresponding to forthcoming low- frequency arrays, these authors predict con- 
straints on dark energy parameters that would be at a precision comparable to 
state of the art type-la supernova observations and galaxy redshift surveys, but 
across a wide range in redshift. 

2.7.3 Galaxy evolution with 21 cm fluctuations after the EoR 
In addition to cosmology, intensity mapping instruments will also provide an 
important new probe for studying the global properties of the galaxy population. 
For example, Wyithe| ( |2008 ) shows that measurement of the PS of 21 cm intensity 



fluctuations (on linear scales) could be used to measure both the galaxy bias for 
DLA host galaxies, and the cosmological HI mass density. On smaller scales, 
simulations suggest that the HI density field is highly biased so that individual 
rare peaks might be detected, facilitating a direct estimate of the cosmic HI mass 



density ( Bagla Sz Khandai 2009 1 . The evolution of the cosmic HI mass density 
and its environmental dependence will provide crucial evidence in our developing 
understanding of the role of neutral gas in galaxy formation. 
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Figure 2: Prom Trac &: Cen ( |2007| ). Image showing the HII density (brighter 
indicates higher density of ionized hydrogen) in 1 Mpc deep shoes through 
a simulation box (50/i^^ Mpc on a side) at redshifts z = 9 (top left), 8 (top 
right), 7 (bottom left), and 6 (bottom right). The volume- weighted mean H I 
fractions are 

{Fni)v = 0.67, 0.46, 0.12, and 1.5 x 10"^, respectively. At high 
redshifts the neutral IGM appears as large dark regions with bubbles of neutral 
hydrogen around the first sources. These bubbles eventually percolate, reionizing 
the universe by the last panel. In the last panel the cosmic web can be seen 
as a tracer of higher density gas. Neutral hydrogen survives in dense galactic 
systems, enabling the detection of HI fluctuations after reionization. Note that 
the image is plotted in terms of HII density which accentuates the appearance of 
HII regions since the fluctuations of HI in the neutral IGM (which are in fact the 
21 cm observable) are not displayed. 
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Figure 3: From Barkana (2009). Evolution of dimensionless 21 cm PS A21 = 
^yWP{k)J{27^^ throughout the EoR, for a model that sets Xi = 98% at z = 6.5 
with a minimum circular velocity for ionizing sources of 14 = 35 km/s. The 
models shown have Xi = 10%, 30%, 50%, 70%, 90%, and 98% (from top to 
bottom at large k) and show the characteristic PS shoulder from HII regions 
moving from small to large scales as reionization progresses. 
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Figure 4: From Lidz et aL] (|2008 |. Amplitude and slope of model 21 cm power 
spectra A^^ ^ k'^P{k)/{2iT^) as a function of ionization fraction (in units of 
(28[(l+z)/10]mK)2. Bottom: Amplitude of the 21 cm PS, at the pivot wavenum- 
ber (A; = 0.4h Mpc~^) for MWA observations in a fiducial (solid red line), rare 
source (blue short-dashed line), and mini-halo models (black long-dashed line), 
plotted as a function of ionization fraction (see Lidz et al. [2008 for further de- 
tails). For comparison, the green dot-dashed line shows the amplitude of the den- 
sity power spectrum obtained by mapping redshift to ionization fraction. Top: 
Slope of the 21 cm PS at the pivot wavenumber for our three models, as well as 
the density PS slope. 
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Figure 5: From [Mesinger &: Furlanetto (2007 1 . Slices from the ionization field 
at z = 6.89 created using different algorithms. All slices are 93.7 Mpc on a side 
and 0.37 Mpc deep, with the mean neutral fraction in the box being xhi = 0.49. 
Ionized regions are shown as white. The left most panel was created by performing 



the bubble-filtering procedure of Zahn et al 
field 



(20071 directly on a linear density 



The second panel was created by performing the bubble-filtering procedure 

The third panel was created 



of Zahn et al. (2007) on their N-body halo field. 



by performing the bubble-filtering procedure of Mesinger Sz Furlanetto (20071 on 
the same N-body halo field. The rightmost panel (from Zahn et al. 20071 was 
created using a ray tracing algorithm on the same halo field. 
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Figure 6: From Geil et al. (20081. A quasar HII region in an evolving IGM. Three 



aspects of the HII region are shown, with shces through the center of the box 
when viewed from the front, top and side. The quasar was assumed to contribute 
ionization equivalent to an HII region of radius iiq,tot = 34 co-moving Mpc, and to 
be centered on z = 6.65, which is also the redshift at the center of the simulation 
box. Each slice is 6 Mpc thick, which corresponds to ~ 3 MHz along the xs-axis. 
In observed units, the cube is ~ 3.3 degrees on a side and 33 MHz deep. The 
model, foreground plus model, and observed maps following foreground removal 
are shown in the upper, central and lower panels respectively. The shape of the 



HII region extracted from the foreground removed cubes is also plotted (see Geil 
et al. 2008 for details). The mass-averaged IGM neutral fraction was assumed 
to be xhi = 0.15. 
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3 Observations 

The last few years have seen major advances in instrumentation for observing HI 
fluctuations. The major existing low frequency observatory has been significantly 
upgraded to search for the EoR power spectrum, and there are three new EoR 
observatories currently under construction. In addition, a number of HI intensity 
mapping machines are in the design and prototype phases, and we expect at 
least one of these to start construction in the next few years. These HI machines 
are pushing the state-of-the-art in widefield observations, digital data handling 
and correlation, precision instrumental and atmospheric calibration, and high 
fidelity foreground subtraction. In this section we review the current state of 
the field with an emphasis on the astrophysical and observational effects that 
drive instrument design, and the solutions different groups have developed. We 
start with a brief introduction to interferometry ( ^3.1| ) and the sensitivity of 
interferometers to the HI power spectrum (^3.2[). We then discuss instrumental 



calibration in §3.3|and end with foreground subtraction in ^3.4 



3.1 Introduction to interferometric measurements 

All of the proposed HI measurements of the EoR and the dark energy equation 
of state utilize radio interferometers, and interferometers are best understood in 
the Fourier domain. In an effort to build a conceptual understanding of interfer- 
ometers, it is instructive to start with the electric field as seen by the array and 
build up to the final brightness measurement. We will limit our discussion to 
narrow fields-of-view and antennas in a plane to focus on the key relationships, 
and the text explanation is paralleled by a pictorial description in Figure [7} The 
interested reader is encourag ed to look at [Taylor, Carilli Perley] ( |1999[ ), |Rau 



148" 
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et al. (20091 and Carozzi &; Woan (2009) for complete mathematical descriptions 



of interferometry, including curved sky, widefield, and non-planar array effects. 

The direction dependent electric field at a reference position E{9, t) can be 
integrated to form the total electric field at the reference position E{t) (small 
FoV limit), or propagated to nearby locations (in the far-field limit) using 



E{r,e,t) = E{e,t)e 



2m e-r/A 



(6) 



Integrating over the sky to determine the total electric field at each location, we 
obtain the fundamental relationship of interferometry: 

E{v,t) = j E{e,t)e^''' ^■'■/^ d^e. (7) 

The electric field as a function of position on the ground is the Fourier transform 
of the electric field coming from the sky (see panels a and b of Figure [7]) . This 
happy coincidence between the electric field propagator and the Fourier transform 
forms the basis of interferometry. 

While radio antennas directly measure the electric field, what we are really 
interested in is the sky brightness I{0). Even over very narrow ranges of frequency, 
the phase of the incident electric field will change randomly (not a delta- function 
in frequency space). However, Equation [6] still holds and the electric field at 
two locations will be delayed by a constant amount. The sky brightness is the 
time-average square of the electric field: 



m = {E{e,t)E*{e,t)), = {\E{9,t)\')^ 



(8) 
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Using the relation that a multiphcation in one Fourier space is a convolution in the 

dual space {HG = h*g), we can express the sky brightness in Fourier coordinates 

either by taking the auto-convolution of E{r) in analogy with equation |8] or by 

directly taking the Fourier transform of I {9). This gives us 

I{u) = {E{r,t)*E*{r',t))^, (9) 

where u = Ar and this is shown pictorially in panels g and h of Figure [7j 

The middle row of Figure [7] describes the measurement process. We can de- 
scribe our array as the locations on the ground r where the electric field is 
collected — for an array of dishes this looks like a series of small circles at the 
location of each antenna as in panel d. The inverse Fourier transform of the an- 
tenna distribution A{r) gives us the electric field point spread response A{6), the 
square of which gives us the usual brightness point spread function B{0). The 
point spread function (PSF) is equivalent to the power response the array would 
measure for a single point source. We can form the last panel in the row (j) 
either by Fourier transforming the point spread function, or more conceptually 
by taking the auto-convolution of the antenna distribution (panel d) . Due to the 
auto-convolution there is a brightness measurement for every pair of antennas 
{N(N — l)/2 pairs). Furthermore the size of each measurement region is ~twice 
the diameter of the antennas shown in ^(r) (for clear apertures A(r) is a circu- 
lar tophat at each antenna, and B(u) is a circular cone at the location of every 
antenna pair /baseline) . 

The last row of Figure [7] then represents the observations. In the Fourier space 
representations the array samples a subset of the incident electric field/brightness 
as shown in panels / and I. Since this sampling is a multiplication in the Fourier 
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domain, it is equivalent to convolving the sky (panels a and g) with the electric 

field or brightness point spread functions (panels c and i). 

While most astronomers are comfortable with convolving the input sky with 

a point spread function, working in the Fourier space is less familiar. However, 

an interferometer works by measuring the electric field at each antenna, which is 

then digitized and converted into cross-power visibilities in the correlator. Each 

of the visibilities corresponds to the brightness measurement 



Im{n) = {Em{r,t)*E*^{r,t)), (10) 

at one of the antenna pairs, as shown in panel I of Figure [7j Each of the visibilities 
measured by the array is equivalent to measuring the underlying Fourier mode 
u. An instrument designer has a lot of flexibility with an interferometer, because 
they can tune the Fourier modes that are measured by changing the arrangement 
and separation of the antennas. For cosmology, this best examples of this are the 



DASI (Leitch et al. 20021 and CBI experiments where the designers maximized 
their signal-to-noise by choosing antenna distributions that only measured the 
angular Fourier modes of interest. In the next section we will see how changing 
the antenna configuration affects the sensitivity of arrays to cosmological HI 
signals. 

In practice the optics of the antenna sum the measured electric field Em{t,r) 
and squeeze it into a single RF cable E'-{t) (integral over the disk of an antenna 
in panel d). The electric field is split into many narrow frequency channels in 
the first stage of the correlator, then the cross-power visibilities between antenna 



pairs are calculated using 



Vij 



[E[{t)E'*{t))^ (11) 
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in each narrow frequency channel]^ The measured visibihty of a pair of antennas 

Vij is equivalent to the average over position (u) and the frequency channel (A/) 



A/A, 



A/ 



J I{u)Bij{u) d^u df 



:i2) 



for the corresponding baseline in panel I, and we have used the relation J f{x) * 
g{x)dx = (J f{x)dx) (/ g{x)dx) to convert the integral over {Em{Tc, t) * E^{r, t))^ 



in Equation 10 into the multiplication in Equation 11 The visibility cross-power 
has units of mK or Jy, depending on the units chosen to express the sky brightness 
I{0), and the uncertainty in the measurement is given by 



^mis(u) 



sys 



(13) 



where r is the integration time for that spatial mode, Tgys is the system temper- 
ature (equivalent power of a resistor on the cable at temperature Tgys), Ag is the 
effective area of an antenna, and A is the observing frequency. 



10 



The system temperature is the sum of the sky brightness — primarily galactic 
synchrotron at EoR and post-reionization frequencies — and the analog receiver 
noise (Tsys = Tgky -|- Tree)- The intensity of the synchrotron emission scales with 
frequency as approximately /~^'^, and varies by orders of magnitude from the 



central regions of the galaxy to high latitudes (see Figure M from de Oliveira- 



Costa et al. 2008 1 . While good uncooled low noise amplifiers can have noise 



temperatures as low as 15 K, the full receiver noise typically ranges from 40 K 



®The procedure described is the FX correlator algorithm (frequency then multiply), and is 
used for all current EoR instruments. There are alternate but equivalent algorithms such as the 
XF algorithm used in the VLA correlator. 

^^Tsys is the dual polarization intensity system temperature i n th is c onte xt. If working in 



Jy units multiply by 2fcB/A^. The the constants in Equations 12 and 13 implicitly assume 
/ B{e) = Ae |Rohlfs fc Wilson|2000[ [Morales fc Hewitt|2004| . 
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to 100 K depending on the impedance match of the antenna and how much of 

the thermal glow from the ground leaks in. For EoR experiments it is common 

to have fairly high receiver temperatures to keep the antenna costs down, as the 

system temperature is dominated by the several hundred degree sky contribution, 

whereas intensity mapping instruments operating at lower redshift must pay much 

closer attention to the analog systems as the sky temperature is only 5-10 K. 

The last step in accurately describing the sensitivity of an interferometer is 

to include sky rotation. While a single baseline measures one Fourier mode u 

instantaneously, the apparent location of the baseline moves as the source tracks 

overhead (in Figure [T] panel / the baseline locations change). Conceptually this 

is easiest to see by envisioning the array painted on the surface of the earth, 

as viewed by the celestial source. From the perspective of the celestial source 

the orientation of the array appears to rotate and foreshorten as the Earth spins 

(if neither the array or celestial source are at the equators). Over the course 

of a multi-hour observation the movement of the baselines through u means 

that a many more Fourier modes are observed. As the sampling of Fourier plane 

improves the quality of the recovered image increases dramatically, an effect called 

'rotation synthesis.' 

However, the uncertainty on a single Fourier mode is related to how long that 
particular mode was observed, not the total observation time. To determine the 
signal-to-noise per mode, we can sample the u plane on a regular grid and add 
the measured correlations to the appropriate sample location (this is done with a 
convolving kernel and is referred to as 'gridding' in the interferometry literature). 
The noise for a single sample v{u) is the total weighted integration time — how 
long that spatial mode was observed. Alternatively we can pixelate the u plane on 
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the scale of the instrumental beam B{u) and count the amount of time baselines 

spent within that u pixel. 

There are a number of important effects which have been ignored in this simpli- 
fied introduction to interferometric observations, including multi-frequency syn- 
thesis, non-coplanar antenna locations (3D u), and curved sky effects. The in- 
terested reader is referred to the review by |Rau et al. (2009), the VLA summer 



school proceedings by Taylor, Carilli Sz Perley (1999), and the papers by Bhat 



nagar et al. (2008), Cornwell, Golap & Bhatnagar (2008) and Morales & Matejek 



(2009). 



3.2 Sensitivity of an HI interferometer 

The HI signals from both the Epoch of Reionization and lower redshifts are in- 
herently three dimensional, with the observed frequency directly providing the 
emission redshift. The emission from the cosmological volume {{x, y, z} in cMpc) 
is mapped by the redshift and angular diameter distance into angular position 



and frequency {9x,Gy,f} (Hogg 1999). However, as discussed in the previous 



section interferometric measurements are performed in Fourier space. A typ- 
ical radio interferometer thus measures the Fourier modes {ua;,Uy} in several 
thousand frequency channels (A/ w 10 kHz), and can be pictured as simultane- 
ously measuring panel I of Figure [7] in several thousand closely spaced frequency 
planes. In many ways, HI measurements are very strange as the {ux,Uy,f} 
measurements can be interpreted as cosmological wavenumbers in the angular 
direction (inverse distance) and line-of-sight distance in the frequency direction 
{kx,ky,z} with units of {cMpc^"*^, cMpc^"*^, cMpc}. The visibility measurements 
can either be Fourier transformed in the angular directions to form an image 
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volume (with PSF artifacts), or can be Fourier transformed in the hne-of-sight 

(frequency) direction to form a three dimensional wavenumber cube {kx,ky,kz} 

dMorales Hewitt ||2004l ). 



The three dimensional A;-space is particularly useful for power spectrum mea- 
surements. In the absence of cosmic evolution and velocity distortions, the 
power spectrum signal is spherically symmetric in the three dimensional /c-space 
{kx,ky,kz}. This approximate spherical symmetry is due to the isotropy (rota- 
tional invariance) of space, and its importance for HI cosmology measurements 
is difficult to overstate. Measurements of the one dimensional power spectrum 
are performed by measuring the variance (average of squares) of all the measure- 
ments within a spherical shell in A;-space (|A;| it AA;/2); astrophysical foregrounds 



have a different shape in /c-space which enables their removal (^3.4.1 and ^3.4.2); 



and the density of measurements in A;-space drives the design of HI cosmology 
arrays (discussed later this section). This is the same rotational symmetry which 
leads to averaging over angular m modes in CMB measurements, just translated 
from the two-dimensional surface of the CMB to the three dimensional volume 
of HI cosmology measurements. 

Of course the spherical symmetry is only approximate — the universe evolves 



very quickly during the EoR, there is large scale gravitational infall (Barkana Sz 
Loeb|2005p , and non-linear velocity effects ('fingers of God') become significant at 
the lower redshifts targeted by HI intensity mappers. For HI intensity mapping 
this is an advantage. Because the cosmological models predict the apparent 
angular diameter, the relative line-of-site distance (Az), and the velocity effects, 
these /c-space asymmetries can be used to tighten the cosmological constraints 



(Visbal, Loeb Sz Wyithe 2008). Unfortunately the theoretical EoR models are 
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significantly more complex, and the evolution of the ionized bubbles and the 

parameters which control this evolution are much less certain (see This 

uncertainty leaves the EoR observer without a definite model to test — there are 

no widely agreed upon parameters which we can use in a Fisher matrix analysis. 

To date the response has been to take a narrow range of redshift Az ~ 0.5 

(about 8 MHz), and assume that cosmic evolution is relatively minor over this 

interval. Isotropy is then used to generate a spherically averaged one dimensional 

power spectrum within each Az ~ 0.5, resulting in a coarse power spectrum 

'movie' showing the dynamics of reionization and how the bubbles expand and 

overlap (see Figures [s] and |4]) . The advantage of this approach is that it is largely 

model independent, and allows the parameters of a wide variety of theories to be 

constrained. Once the theory matures, the EoR will become more like the CMB 

with specific parameters that we can constrain. 

For power spectrum measurements, the signal P{k) is independent for each 

Fourier mode in the three dimensional /c-spacej^ This statistical independence 

makes /c-space the natural frame for calculating the sensitivity of an interferom- 



eter. (Here we are closely following the papers by Bowman, Morales Sz Hewitt 



(2006) and McQuinn et al. (2007|.) In the natural {u^,-, Uj^, /} space of an inter- 
ferometric measurement, the uncertainty within a given 'pixel' of u or k± space is 



given by Equation 13 where the size of the pixel is given by the width of the an- 
tenna X 2 {B{u) = A{r) *A*{r)), the depth of the pixel is given by the frequency 
resolution (A/ or dz), and the integration time is given by the total amount of 
time the various antenna baselines spend within that pixel of Fourier space. We 

can transform this {ux,Uy,f} cube along the frequency /redshift dimension and 

^^There are non-Gaussian and non-stationary statistical correlations during the EoR imprinted 
by the bubbles, but no one has figured out how to measure these statistical properties/topology 
with first generation arrays. 
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map to cosmological coordinates to obtain a measurement in {kx,ky,kz}- The 

uncertainty per 3D cell in A;-space is 



^rn.s(k) = (14) 



where B is the bandwidth of the measurement (limited at EoR frequencies by 
cosmic evolution to ~8 MHz), and r is the total integration time by all baselines 
for the cell. 

For a power spectrum signal, what is defined is the variance of the signal 
P{k). In practice the measurement I{k) is divided into two separate integrations 
of equal length, which are multiplied together to form |/(A;)p. (Splitting the 
measurement eliminates the thermal noise power contribution, as the noise is 
independent for the two integrations but the signal is the same, M. Tegmark 
private communication.) The expected signal in each /c-space cell is given by 

{\m\^) = j p{k)\B{k)fd'k, (15) 



where we have mapped the beam B{u, f) into ^-coordinates, The thermal 
uncertainty per /c-space pixel is exponentially distributed (due to squaring the 
measurements) and has an rms of 



15 



as the Fourier 



'^^Be very careful when inserting a predicted power spectrum into Equation 
transform convention must be the same for all terms. There are three Fourier transform con- 
ventions in common use, each with different normalizations (27r's), and this has led to numerous 
mistakes. 
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In addition there is a sample variance contribution to the uncertainty 



[C'\^s = Pik)'^S^r (17) 



e 



For intensity mapping measurements the expected signal per /c-cell (Equation 



15) and the uncertainties (Equation 16 and 17) can be used in a Fisher matrix 



analysis to directly determine the uncertainty on the cosmological parameters 
(such as those for the dark energy equation of state). For EoR analyses, one 
typically assumes isotropy and forms a variance-weighted average of all the k- 
cells within a spherical annulus of \k\ zt Ak/2. The uncertainty on the resulting 
one dimensional power spectrum bin is the weighted average of the thermal and 



sample variance terms (Equations 16 and 17 ), and is proportional to 1/ \/Nk where 



Nk is the number of A;-cells averaged over. For one dimensional power spectrum, 
it is convenient to plot the measured values in terms of the input power spectrum 
by dividing both the measured signal and the uncertainty by J \B{k)\'^ d?k (from 



Equation 15) as in Figure [9| where examples are shown both for the MWA at 
z > 6 and for the CARPE at z = 1 - 2. 

It is instructive to look at how these relationships affect array design. Each an- 
tenna pair, over a short observation time, measures a particular angular Fourier 
mode at all the observed frequency channels. When we move from the {u^^, u^^, /} 
space of the original measurements to the three dimensional /c-space of the cosmo- 
logical power spectrum {k^, ky, kz}, each antenna baseline measures one angular 
k± mode but all of the line-of-sight k^^ modes. A single baseline can be envisioned 
as measuring all of the /c-space cells along pencil beam parallel to the line-of-sight. 



as shown in Figure 10 This means that short baselines (small k±) still measure 



very small spatial scales (large k) along the line-of-sight A;|| direction. Short base- 
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lines measure all k modes, whereas long baselines only measure modes larger 

than that baseline. 

Given the choice of spending more time integrating on one mode and divid- 
ing the observing time across multiple modes, you always win by integrating on 



one mode until you reach a signal-to-noise ratio of ~1 (Halverson 2004 1 . The 
thermal noise on the power spectrum decreases linearly with the integration time 



(Eq. 16), until the thermal noise is less than the signal at which point the noise 



is dominated by the sample variance (Eq. 17). In the low signal-to-noise regime, 
measuring two modes provides a \/2 improvement, whereas measuring one mode 
twice as long provides a factor of 2 improvement. DASI was designed to cancel 
the sky rotation so antenna pairs always saw the same angular mode, increasing 
the integration time r on that mode at the expense of measuring fewer modes. 
For HI interferometers it is impractical to cancel the sky rotation, but it is com- 
mon to have many redundant antenna pairs (same separation and orientation) 
simultaneously measuring the same mode, thus r for a particular mode can be 
greater than the total observing time. 

Even for a particular of interest, the experimenter can choose whether to 
focus the measurement in portions of the annulus more along the angular direction 
(A;_l) or more along the line-of-sight direction (/cy). For EoR machines the primary 
signal is from a largely isotropic P{k), and very short baselines provide a much 
higher signal-to-noise: short baselines measure all k, concentrating the array 
provides many more redundant baselines and much higher r on short baseline 
cells, and the foreground contamination is least in the k^\ direction (detailed in 



{ 3.4.1 ). For the first generation EoR arrays nearly all the cosmological sensitivity 
comes from baselines shorter than a few hundred meters, and longer baselines are 
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primarily for instrumental calibration and foreground determination. 

For 21 cm intensity mapping experiments the question is a bit more complex. 
Because the theory of structure formation is much more mature there is a com- 
mon set of cosmological parameters (^A) ^u, ^s, etc.). These parameters map 
into both the symmetric and asymmetric terms of the power spectrum in dif- 
ferent ways (Equation [5]) . The standard solution is to perform a Fisher matrix 
analysis for specific proposed instruments (e.g. Visbal, Loeb Sz Wyithe|[2008 1 to 
show the parameter constraints and degeneracies of the proposed observations. 
However, an HI instrument designer has some freedom over both the range of 
scales measured and their angular distribution — by varying the antenna layout 
the signal-to-noise distribution in the three dimensional fc-space can be tuned 



(Figure 10). fe-space Fisher plots, as shown in Figure 11 can help the designer 
of an intensity mapping instrument maximize the sensitivity to the parameters 
of interest, while helping break parameter degeneracies. 

So far we have discussed power spectrum sensitivity exclusively, but there are a 
number of science applications such as Stromgren sphere imaging, non-Gaussian 
EoR statistics, topology /non-stationary EoR statistics, and cross-correlation with 
smaller area optical surveys which benefit from imaging — a signal-to-noise >1 
per mode (corresponds to phase uncertainty on the mode less than ~1 radian). 
This is in direct tension with the power spectrum measurements — in the power 
spectrum you maximize sensitivity by integrating to S/N of ~ 1, then move to 
a new field (or redshift range) without ever achieving the sensitivity to create 
quality HI images. The boundary tends to be a bit fuzzy for HI interferometers 
as some A:-cells will hit the imaging limit before others, but the tension is real. 

The power spectrum sensitivity scaling also affects the decision to track the 
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sky vs. allowing it to drift overhead. In the low signal-to- noise regime, a tracking 

instrument is more efficient as it integrates longer on the same modes instead of 

splitting the observing time over independent modes in separate fields {l/^/Nk). 

However, non-tracking antennas are simpler and cheaper to construct allowing a 

larger collecting area at the same instrumental cost. Both tracking and drifting 

approaches are being pursued by first generation HI interferometers, with GMRT, 

LOFAR, MWA, and CARPE tracking the sky and PAPER, CHIME and CRT 

drift scanning. 

An example of the power spectrum sensitivity of first generation EoR and 
intensity mapping experiments is shown in Figure [9] The instrumental sensitivity 
has been calculated by determining the signal-to-noise ratio in each 3D /c-cell. 



then performing a weighted average within the spherical fc-shells (Figure 10) to 
produce the one dimensional power spectrum. For historical reasons there 
are more EoR sensitivity calculations for the MWA than other instruments, but 
all of the first generation machines are very similar with the limits usually just 
scaling by a factor of a couple. For both EoR and post-reionization instruments 
the sensitivity increases (lower limits) towards smaller \k\ until the foregrounds 
dominate ( ^3.4[ ). During the epoch of reionization the signal evolves rapidly with 
decreasing redshift (compare to the red dotted fully neutral PS, and Figure [s]), 
while the instrumental sensitivity slowly increases as the sky temperature falls. 
During the peak of reionization the first generation EoR instruments can hope to 
see about a decade of power spectrum between the foreground limit at low \ k\ and 
the sensitivity limit at high \ k\. The increasing sky temperature makes detections 
of the 21 cm power spectrum beyond a redshift of ~10 very challenging for first 
generation machines even if the evolving signal is near maximum. The majority 
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of the information about reionization from the first generation EoR observations 



will come from the evolution of the power spectrum amplitude and slope (Lidz 



et al. 2008 and Figure |4]). For the post- reionization instruments, much of the 
science case is driven by detecting the BAO signature which is plotted in the last 
two panels. While the BAO signal is much weaker than the reionization signal, 
the intensity mapping machines are helped by the much lower sky temperature 
at these redshifts. 

The first generation EoR and intensity mapping experiments will also be able 
to image the largest scale structures. In Figure|9]the sample variance contribution 
to the power spectrum is plotted as a dashed line, and starts to limit the sensi- 
tivity at the largest scales. While from a power spectrum perspective this means 
one should start observing additional fields, it means one can start to image at 
the scales where the sample variance is limiting. Figure |6] shows the imaging 
capability of a large quasar Stromgren sphere for a first generation EoR machine. 
It should be noted that imaging sensitivity should be calculated on a per-mode 
basis just as the power spectrum is — the radiometer equation assumes a uniform 
antenna distribution and is dramatically wrong for the very centrally condensed 



antenna distributions of most 21 cm fluctuation machines ( Geil Sz Wyithe|[2008 1 . 



3.3 Interferometric calibration 

All of the HI experiments feature compact antenna configurations and high survey 
speeds to maximize the sensitivity relations detailed in the previous section. As 



we will detail in ^ 3.4 , successful separation of the bright astronomical foregrounds 
from the very faint EoR and BAO signals will require extraordinarily precise 
instrumental calibration. EoR instruments are pushing the state of the art in the 
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calibration of radio instruments (Mitchell et al. 2008, Pen et al. 2008a Nijboer 



k Noordam][2007l [Parsons k Backer|[2009l ) , and it is in calibration that the craft 



of instrument building comes to the fore. 

Traditional calibration determines a single per-antenna gain and phase using 
observations of a bright isolated point source. This is usually performed using the 
selfcal algorithm (detailed in Taylor, Carilli k Perley||l999 ) which assumes all of 
the flux comes from the dominant source and adjusts the complex antenna gains 
to maximize this flux, solving for a single complex gain per antenna (amplitude 
and time delay) . This single complex gain calibration works well in the limit of all 
antennas being identical and the atmospheric propagation delay being constant 
across the field-of-view. For an instrument like the VLA (Very Large Array) 
the antennas are regularly adjusted to keep their figures close to identical and 
the narrow field of view naturally limits angular variation in the atmospheric 
propagation delays. 

While gain and time delays generated after the antenna couples the incident 
radiation into the RF cable (cable losses, temperature dependent amplifier gain 
and filter characteristics, mixer phase delays, etc.) do affect all of the incoming 
electric signal equally, effects which occur within the optics of the antenna can im- 
part direction-dependent amplitude and delay changes. The direction-dependent 
gain can be envisioned as a envelope which attenuates the true incident electric 
field, or more physically it can be represented in the Fourier plane as the effi- 
ciency (and delay) of radiation hitting each segment of the antenna of coupling 
into the RF cable. The Fourier representation is more commonly known as the 



holographic antenna map (Scott k Ryle 19771, and is typically produced by us- 
ing a reference antenna to look at a very bright astrophysical source while the 



Cosmic 21 cm Fluctuations 63 
antenna under calibration is raster scanned so the apparent amplitude and phase 

of the source can be measured in all look directions. This gain map is Fourier 
transformed to produce an image of the antenna surface. Holographic antenna 
maps reveal important physical constraints on the antenna calibration problem: 
only electric field hitting the surface of the antenna can couple in so the gain can- 
not vary with direction faster than X/D (wavelength and antenna diameter) 



13 



and features smaller than ~ A/2 are averaged over by the spatial coherence of 
the electric field {6 < 7r/2, aim wave does not see a 1 cm bolt). This provides 
an upper limit of ~4A/A^ on the number of parameters needed to describe the 
single frequency calibration of an antenna — one complex number for each 
patch of the antenna surface. In practice the number of calibration parameters 
may be significantly less if there are typical modes of deformation (e.g. gravity 
sag of a dish, dipole based gain factors), but these are best conceptualized as 
additional constraints on the characteristics of the antenna reception surface. 

The scales of the atmospheric/ionospheric phase delays depend on the am- 
plitude and inner and outer scales of the turbulent Kolmogorov cascade, plus 
additional modes such as large scale Traveling Ionospheric Disturbances (TIDs). 
There are three different regimes of atmospheric distortions which concern HI 



interferometry (following Lonsdale 2005, starting with his regime 2). In regime 
2, the antenna baselines are long compared to the ionospheric scales, but the 
field-of-view is small so the length scale at ionospheric heights is small compared 
to the minimum distortions. In this regime sources scintillate, but the scintil- 
lation pattern is the same for all sources in the field and it can be calibrated 
with a single ionospheric time delay per antenna. Regime 2 is commonly en- 



^^reflections from outside the antenna, curved sky effects, and Fresnel diffraction of the an- 
tenna's three dimensional structure can produce small contributions from outside the antenna 



diameter ( [Morales fc Matejek|2009 1. 
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countered with VLA observations and is well corrected by the traditional selfcal 

algorithm. In regime 3 the antenna baselines are shorter than the ionospheric 
distortion scales, but the field of view is large so the apparent source positions are 
refracted in different amounts across the field but do not scintillate. This can be 
corrected with a 'rubber sheet' correction to map sources from their observed lo- 
cation back to their true location, and is the typical regime for EoR observations. 
Regime 4 is the most challenging situation, where both the antenna baselines and 
field of view are large compare to ionospheric distortions, leading to a direction- 
dependent scintillation of sources. The correction of these regime 4 distortions 
will be needed for high resolution imaging by LOFAR and the LWA (Long Wave- 



length Array) and are at the forefront of ionospheric calibration research ( van der 



Tol, Jeffs k van der Veen 2007). 



Mathematically correcting interferometric radio data is identical to optical/IR 
adaptive optics, with regimes 2 and 4 corresponding to narrow and widefield 
adaptive optics respectively. The Fried length for ionospheric distortions (length 
over which phase delay is <C 27r, i.e. refraction but no scintillation) is typically 
several kilometers with timescales of minutes, but can vary by orders of mag- 
nitude depending on the ionospheric conditions and the frequency of observa- 
tion. Of additional concern to HI cosmology observations is non-diagonal noise 
contributions from ionospheric distortions. While the noise of individual mode 
measurements (visibilities) is usually independent, the stretching and compress- 
ing of ionospheric distortions can create the spatial analog of intermodulation 



distortion, mixing spatial modes before they are measured by the array (Morales 



Sz Matejek 20091. This introduces non-diagonal correlations into the visibility 



correlation matrices, and it is unclear what effect this may have on extracting 
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cosmological signals. 

The direction-dependent antenna gain and the ionospheric distortions can be 
corrected if enough cahbrators can be observed to determine the cahbration pa- 
rameters. Because the radiation wavelength is on the meter scale, it is difficult to 
construct an anechoic chamber large enough to get into the antenna far field, and 
no one has succeeded in designing a precision balloon, satellite, or tower based 
calibration system (any conductive support or wire is refiective, and it must raster 
through the antenna field-of-view) . For these reasons all of the first generation 
EoR instruments are relying on compact astrophysical objects as their calibration 
sources. The difficulty with astrophysical calibrators is source isolation — when 
the emission from one source is selected it contains some emission from sources 
across the field due to the finite point spread function (Figure [7] panel c) . The 
cross-leakage of calibrator sources adds systematic errors that limit the precision 
to which the calibration parameters can be solved for. 

The four EoR observatories have taken different approaches to calibration, and 
are all pushing the state-of-the-art in radio calibration. Briefiy the approaches 



are: 



GMRT is using a novel pulsar based calibration approach ( Pen et al.|2008a |. 



Using a customized correlator they separate visibilities from when the pulsar 
is on from when it is off. By subtracting the on and off data they are left 
with only the pulsar emission, beautifully isolating the emission of their 
calibrator from all other sources. Unfortunately the space density of bright 
pulsars means there is only one calibrator in their field of view. This is 
less of a problem for the GMRT as it has the smallest field of view of the 
EoR machines, but it restricts the ability to perform direction-dependent 
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calibrations. 



PAPER calibration relies on a very stable antenna gain pattern (Parsons 



Sz Backer 2009). The PAPER antenna is designed to have a predictable 
gain response, and as it only looks straight up the response is very stable. 
Effectively this reduces the number of calibration parameters that need to 
be solved for. 

MWA calibration simultaneously fits the gain for ~100 calibrator sources 
across the antenna field of view and sidelobes, effectively producing holo- 
graphic antenna maps and an ionospheric refraction map every 8 seconds 



( [Mitchell et al.||2008[ ). The 512 antennas of the MWA provide an excellent 
point spread function, improving the source isolation between calibrators 
at the cost of an enormous data rate. The data rate necessitates that the 
calibration be performed in real time, as the data volume is too large to 
archive for long observations. 

LOFAR has fewer 'virtual' antennas (stations) than the MWA, and can 
thus afford to save all of the visibility data for offline processing. However, 
this reduces the source isolation. LOFAR calibration uses a full maximum 
likelihood solution of the calibration parameters coupled with their EoR 
foreground subtraction and a three-dimensional ionospheric model to work 



around the calibration leakage issue (Labropoulos et al. 2009 Nijboer Sz 
Noordam|[2007l ). 



So far the intensity mapping instrument designs have concentrated on achieving 
sufficient signal-to-noise, but we expect calibration to become a focus as the 
designs mature. In this review we have concentrated on measuring the calibration 
parameters, but how to use the calibration in the data analysis pipeline should 
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also be considered. The interested reader is referred to the overview by Rau et al. 



( 2009 1 and hterature on the analysis of direction-dependent calibrations through 



either image faceting ( |Schwab|1984l [Taylor, Carilh k Perley|1999p or more recent 



software holography techniques ( Bhatnagar et al.|2008 Morales Sz Matejek|2009 1 



3.4 Foreground removal 

The primary challenge facing HI measurements is foreground subtraction. Astro- 
physical sources such as Galactic synchrotron radiation and extra-galactic radio 
galaxies provide a sea of foreground emission ~5 orders-of-magnitude brighter 
than the EoR and post reionization redshifted 21 cm fluctuation signals. In 
addition there are strong terrestrial and satellite broadcasts and a number of 
instrumental effects. These unwanted contaminants must be removed to high 
precision to detect the cosmological HI signals. 

While almost all the literature concentrates on the EoR, it can be translated 
directly to higher frequency BAO observations. Most of the foreground work 
remains theoretical, and we expect the techniques will change dramatically over 
the next couple of years as they are confronted with real data. We are just 
starting to see the first deep integrations at EoR frequencies by Ali, Bharadwaj 



k Chengalur (20081, Pen et al. (2008a) and Schnitzeler, Katgert k de Bruyn 



(20091, and these early observations are already having a strong impact. 

The early foreground subtraction literature concentrated on astrophysical fore- 
grounds that might produce a fundamental barrier to HI observations. In the 
past few years research has shifted to observational effects such as chromatic 
point spread functions and imperfect polarization purity, and how they compli- 
cate the removal of the astrophysical foregrounds. We will start with a brief 
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review of astrophysical foregrounds in ^3.4.1 then will concentrate on the more 



recent observational foreground issues in ^3.4.2 



3.4.1 Astrophysical FOREGROUNDS The major astrophysical foregrounds 
are synchrotron and free-free emission from our Galaxy and a confused sea of 
extra-galactic radio sources. The angular fluctuations in the Galactic emission at 



high latitude are quite small (de Oliveira-Costa et al. 20081 and largely resolve 



out in interferometric observations However, at arcminute scales extra-galactic 
radio galaxies produce a confused and highly structured surface of emission that 
is only a few times fainter than the Galactic emission. 

The Galactic synchrotron emission has a typical spectral index of Z"^'^, and 



de Oliveira-Costa et al. (20081 has compiled low frequency observations into 



a comprehensive model of the large scale galactic intensity emission (Figure 
[s]). While the Galactic intensity is spatially very smooth, it is also polarized 
and Faraday rotation introduces significant structure in polarized maps at ar- 



cminute scales. Observation by de Bruyn et al. (2006) and Schnitzeler, Katgert Sz 



de Bruyn (20091 with the Westerbork Synthesis Radio Telescope (WSRT) at 315- 



388 MHz (redshift 5.5-4.6) have shown ~3 K polarized structure on arcminute 
scales and ~7 K fluctuations at the 150 MHz band targeted by EoR observations 
( Bernardi et al.|[2009 ). However, most of the published WSRT observations have 
focused on the Fan region which is known to have significant polarization struc- 



ture. Observations by Pen et al. (2008b I with the GMRT in a different 150 MHz 



window show significantly less polarized emission. From current observations it is 
unclear how much polarized structure there will be in the windows of interest for 



^*There are no zero-spacing measurements for technical reasons, so the fc-space (5-function 
at zero (mean intensity) is not measured. However, the unresolved Galactic emission can still 



dominate the system temperature [3.1 
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EoR measurements, though much larger surveys of polarized foreground emission 

will be released soon (Gianni et al. personal communication). 

The extragalactic foreground emission ranges from extremely bright radio galax- 
ies (e.g. Gas A) down to a confused sea of faint sources. On the faint end there 
are hundreds of sources per angular pixel — all pixels contain foreground emis- 
sion. Our expectations for the extragalactic emission come primarily through 



increasingly detailed simulations by Di Matteo, Giardi Sz Miniati (2004 1, Bow- 



man, Morales Sz Hewitt (20091, and in particular the comprehensive models by 



Jelic et al. ( 2008 1 . The confusion level foreground has additional spectral features 



due to the co-addition of many sources along the same line-of-sight with different 
spectral indices and curvature points, though the luminosity distribution is flat 
enough that the emission tends to be dominated by the brightest source in each 
angular pixel. 



Di Matteo et al. (20021 and Oh & Mack (20031 showed that the clustering 



and fluctuations of radio galaxies presents an insurmountable obstacle to angular 
HI power spectrum measurements. The key to foreground subtraction is the 



spectral smoothness of the synchrotron and free-free emission ( Di Matteo, Giardi 



k Miniati 


2004 McQuinn et al. 


2007 


Gnedin & Shaver||2004 


Santos, Gooray &: 



Knox 2005 Zaldarriaga, Furlanetto k Hernquist 2004 Morales k Hewitt 2004 1. 



The fluctuations in the HI density and spin temperature along the line-of-sight 
direction (redshift) create small spectral fluctuations which are superimposed on 
the smooth synchrotron and free-free foreground spectrum. First generation HI 
instruments can be thought of as looking for small spectral fluctuations due to 
changes in the HI density along the line-of-sight. While the foreground emission is 
not perfectly flat due to synchrotron self-absoption, the admixture of sources with 
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different spectral indices, and otlier effects, all of the foreground spectral features 

are very smooth. A number of techniques have been developed for fitting out the 

slowly varying foreground emission along each line of sight, including polynomials 



(McQuinn et al. 2007 Jelic et al. 20081, double log polynomials (Wang et al. 



2006), and non-parametric techniques ( Harker et al.|[2009 |. All of these methods 



work well, and qualitatively, fit out the first few line-of-sight Fourier modes where 
the foreground dominates, leaving the more rapid line-of-sight HI fluctuations. 
It is instructive to look at the foregrounds in the three dimensional fc-space of 



Figure 10 The Galactic emission is very smooth both spatially and spectrally, 
and only adds power at very small \k\. The extra-galactic emission has a lot of 
angular k± power due to fluctuations in the number of galaxies per angular pixel 
and galaxy clustering, but their spectra are also very smooth and fall rapidly 
after a couple of modes in . Qualitatively the foreground removal techniques 
remove the first ~3 A;|| bins along the line of sight, leaving the higher /cy bins 
largely free of the foreground contamination. 

In addition, the functional forms of the foregrounds and expected signals dif- 
fer: the cosmological signal is approximately spherical in shape due to the com- 
bination of spatial isotropy and a redshifted emission line ( ^3.2[ ), whereas the 
foregrounds have a separable-axial symmetry (~cylindrical) because there is no 



relationship between the angular and line-of-sight /frequency emission (Morales 
k Hewitt|[2004l [Zaldarriaga, Furlanetto Hernquist]|2004[ ) . 



The difference in the three dimensional /c-space symmetries provides an im- 
portant handle for distinguishing foregrounds from the HI signal. The only fore- 
grounds which can mimic the symmetry of the HI signal are radio lines at high 
redshift. There was concern that redshifted radio recombination lines could pro- 
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vide just such a confusing foreground, but more recent analysis predicts this 



will be a very small contribution (Furlanetto, Oh &; Briggs 20061. Symmetry- 



based subtraction algorithms can further reduce foreground contamination and 



determine the systematic errors due to foreground subtraction residuals ( Morales 



Bowman Sz Hewitt 2006). 

It is possible to partially recover the angular modes (small component) on 
large scales where the measurement pushes into the SN > 1 per mode imaging 



regime. Simulations by Geil et al. (2008) and Furlanetto, Oh Sz Briggs (20061 



show that large ionized Stromgren spheres around quasars can be imaged in the 
presence of strong foregrounds, though foreground removal does decrease the 
contrast of the image (Figure [6]) . 

The general consensus is that there are no known astrophysical foregrounds 
which present a fundamental impediment to HI power spectrum or imaging ob- 
servations. However, no instrument is perfect and removing astrophysical fore- 
grounds from real world observations remains the primary challenge of both EoR 
and intensity mapping measurements. In the literature the discussion of fore- 
ground removal has shifted from the fundamental astrophysical foregrounds dis- 
cussed in this section, to the removal of foregrounds from upcoming observations 
which we discuss next. 

3.4.2 Observational foregrounds While astrophysical foregrounds can 
be removed in principle using the smooth spectral properties, small observa- 
tional contaminants can mix bright astrophysical and terrestrial foregrounds into 
the spectral domain and mask the faint HI spectral fluctuations. Observational 
contaminants include ionospheric refraction and scintillation, non-linearity in 
the analog RF system, chromatic PSF and FOV, and calibration errors in the 
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direction-dependent gain and polarization response of the antennas. These errors 

can combine with astrophysical and terrestrial foregrounds to create second-order 

contaminants — an effect commonly called mode mixing. 

As an example of mode mixing, let's consider a mis-subtracted point source 

(e.g. a radio galaxy) interacting with the chromatic instrumental PSF as shown 



in Figure 12 The PSF of all HI interferometers is inherently chromatic — the 
apparent separation of the antennas in wavelengths changes as a function of 
frequency (Figure [7] panels c k, d). If a source is mis-subtracted, either because 
the flux is not known precisely, the instrumental gain of all the antennas towards 
the source is not known accurately (error in knowledge of the angular PSF), or 
the frequency dependence of the antenna gains is not known accurately (error in 
frequency dependent changes of the PSF), there will be residuals in the image 
cube proportional to the product of the errors in the source flux and the PSF or 
the product of the source intensity and the errors in the instrumental calibration. 
Because the angular position of these contaminants changes as a function of 
frequency, they introduce spectral fluctuations along lines-of-sight away from the 
source location. Effectively this has mixed a purely spatial foreground — a smooth 
spectrum point source — into the frequency domain, complicating the foreground 
subtraction techniques discussed in the previous section. 

The experimental emphasis on instrumental and atmospheric calibration ( ^3.3[ ) 
is an effort to mitigate these mode mixing contributions to the foreground sub- 
traction problem. Observational foregrounds are characteristically a product be- 
tween an instrumental term and an astrophysical or terrestrial foreground. The 
product of our calibration uncertainties and our foreground uncertainties must 
be less than the expected EoR signal, or better than a part in 10^ for most mode 
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mixing terms. While mitigation of observational artifacts has been a significant 

design driver for EoR and intensity mapping observatories, much of this work is 
only now working its way into the literature (they're busy building instruments). 
In this section we will briefly review the main sources of observational foreground 
contamination and the techniques being used to reduce their effect. 

Ionospheric calibration. Phase delays in the ionosphere lead to refractive dis- 
placements (and at times scintillations) in the apparent location of bright sources. 
Errors in the ionospheric calibration can lead to the source subtraction being 
slightly offset from the apparent source location. This leaves residual flux in the 
image cube with a characteristic chromatic PSF pattern, mixing the angular error 
in the source location into the frequency domain. As ionospheric displacement 
scales as A^, this is a larger problem for EoR observatories than post-reionization 
arrays. All of the EoR arrays are located at mid geomagnetic latitudes (between 
the magnetic pole and equator) where the ionosphere is better behaved. The low 
PSF sidelobes of the MWA reduce both the contamination from mis-subtractions 



and the systematic uncertainties in the apparent source locations (Mitchell et al. 



2008), while LOFAR uses its long baselines to create a full 3D model of the 



ionosphere (Intema et al. 20091. 

It has been suggested that as the ionospheric fluctuations are statistical, they 
could be ignored and this effect would average out (effectively the foreground 
sources would all be smeared in a similar way, Zaldarriaga & Tegmark personal 
communication). However, we have very little data on the statistical distribution 
of ionospheric distortions and it is unclear if they will average out sufficiently the 
over course of proposed EoR observations. 

RFI. Radio frequency interference from terrestrial and satellite transmitters 
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can be extraordinarily bright — a one Jansky astronomical source is only 10-26 W/m^ 

Most transmissions are characteristically narrow in frequency and can be excised 



in the frequency domain However, non-linearities in the analog receiving sys- 
tem introduce frequency mixing (intermodulation distortion), producing signals 
at the sum and difference of the RFI frequencies. Recursive mixing of the RFI and 
the mixing products can create a forrest of faint frequency contaminants. Sev- 
eral projects have selected remote areas of Western Australia (PAPER, MWA, 
CARPE) or the Sahara (CRT) to reduce the amphtude of the RFI, while LOFAR 



and the GMRT have concentrated on advanced RFI subtraction algorithms (Pen 
et al.||2008aD , very linear front ends, and higher ADC bit depths. 



Astronomical point sources. The flux of smooth spectrum astrophysical sources 



can mix into the frequency domain as described earlier (Figure 12). For bright 
astronomical sources the source flux and spectrum can be well determined, and 
most of the mixing comes from errors in the instrumental calibration of the 
antennas. In particular, the direction-dependent gain calibration of each antenna 
must be well characterized so the PSF towards each bright source is well known. 

For the confusion level sources it is much harder to determine the true source 
fluxes, as the observed brightness in an angular pixel is a combination of the 
true source intensity and the sum all other sources times the array PSF. In 
this case it is the uncertainty in the source fluxes which dominates the mode 



mixing. Bowman, Morales Sz Hewitt (2009) and Liu, Tegmark Sz Zaldarriaga 



(2008) have looked at this contamination for image and uu-plane foreground 
subtraction approaches respectively. Particular care must be taken to ensure 
that the algorithm fitting the smooth frequency emission of the confusion level 



^^Digital transmissions do not have a dominant carrier wave and transmit much more informa- 
tion, thus have a broader frequency signature and fill much more of their frequency allocation. 
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sources is not biased by the chromatic PSF contributions. Figure 13 shows the 
residual /c-space contributions from the image space foreground fitting. While 
these algorithms are specific to the MWA and its excellent baseline coverage, 
these studies suggest that the chromatic PSF and confusion level source mode- 
mixing can be removed to sufficient accuracy. 

Galactic polarization. While the EoR and post-reionization cosmology sig- 
nals are unpolarized, the polarized Galactic emission can leak into the unpolar- 
ized intensity measurement through instrumental polarization calibration errors. 
Faraday rotation by the interstellar magnetic field creates a /^^ rotation of the 



observed polarized Galactic synchrotron emission (0 oc |de Bruyn et al.|2006| 



Schnitzeler, Katgert & de Bruyn 2009 Bernardi et al. 2009 Pen et al. 2008a I . 



An enhancement in the instrument response at a particular polarization angle 
will create structure in the frequency direction as the rotating polarization angle 
beats with the polarized instrument response. Instrumental calibration errors 
mix the Stokes I, Q, U, SzV maps, and the Galactic synchrotron emission has a 
lot of spatial and frequency structure in Q &; U. The polarized Galactic emission 
is a few K on arcminute scales ( ^3.4.1 ), and there are often multiple rotation 
measures along each line-of-sight providing emission components which rotate at 
different rates and interfere to create additional frequency structure. 

In addition to precision polarization calibration, it is possible to use the charac- 
teristic dependence to separate the polarized Galactic emission from the EoR 
signal. By going to rotation measure space — Fourier transform of A^ — much of 
the polarized Galactic emission will resolve into components at specific rotation 
measures (Gaensler, personal communication). These contaminants can then be 
subtracted, as the rotation measure space is largely orthogonal to kii . 
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The EoR observatories are at the forefront of radio cahbration, and mode- 
mixing is Hkely to determine the success of the first generation of HI measure- 
ments. If it were not for the presence of strong foreground emission, the cahbra- 
tion requirements of HI machines would be rather modest. However, the mixing 
of the foreground emission into the frequency domain via instrumental effects 
places stringent limits on the calibration of first generation 21 cm fluctuation 
observatories. 
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Figure 8: A plot of the synchrotron emission from de OHveira-Costa et al. (2008) 



on a logarithmic relative amplitude scale (amplitude depends on frequency). The 
brightness of the of the galactic plane is easily seen, as are large scale features 
such as the North Polar Spur and extremely bright extra-galactic sources such as 
Cassiopeia A. Oliveira-Costa's model can be scaled to any of the frequencies of 
HI cosmology and includes dust emission and spectral curvature effects. 
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Figure 9: Comparison of spherically averaged 21 cm power spectrum and array 
sensitivity at redshifts between z = 9.5 and z = 1. The thick blue curves show a 
model for the 21 cm power spectrum signal ( Wyithe et al.|[2"009 during reioniza- 
tion and the linear mass power spectrum with xhi = 0.01 populating dark matter 
halos of M > IO^^Mq for z = 1 & 2), while the dotted red hne shows the 21 cm PS 
assuming a fully neutral IGM as a reference. At z > 6 the sensitivity is estimated 
over 1 field and within a 32 MHz bandpass for the MWA (dark solid lines) and 
for a future array with a 10-fold increase in the number of antennae tiles (dark 
dotted lines McQuinn et al.|[2006 |. At z = 1 & 2 the sensitivity is estimated over 
3 fields and within a 300 MHz bandpass for CARPE (dark solid and dashed lines 
show overall sensitivity and the cosmic variance limit respectively.) The sensi- 
tivity is plotted within bins of width Ak = fc/10 assuming 1000 hr integration in 
both cases. The sharp upturn at low k is due to the assumption that foreground 
removal prevents measurement of the power spectrum at scales corresponding to 
a bandpass larger than 1/4 of the total observed bandpass. To illustrate the ad- 
ditional sensitivity required for dark energy experiments, the baryonic oscillation 
component of the power spectrum is plotted at z = 1 & 2 (thin blue curves). 
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Figure 10: Illustration of the A;-space measurement space of an HI interferometer. 
The instrument measures the intensity in fc-space cells, with the power spectrum 
intensity and uncertainty per cell described by Equations [151 [16) & [TTj For 



EoR measurements the power spectrum intensity is averaged within |A;| annuli 
(curved regions). The maximum k±_ is set by the longest antenna baseline, and 
the perpendicular width of the cells is 1 /FOV. The maximum fcy is the inverse 
of the frequency resolution, and is typically much larger than the maximum 
angular wavenumber. The line-of-sight length of the cells is set by the inverse 
of the bandwidth for intensity mapping instruments, and for EoR observations 
bandwidth is further limited by the Az ~ 0.5 cosmic evolution limit. Lastly, 



foreground contamination removes the first few A;|| modes (^3.4). 
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Figure 11: These plots show examples of A:-space Fisher plots for HI observations 
at a redshift of 1.9 (Eli Visbal, personal communication). Each panel shows 
the change in the power spectrum field due to a change in one parameter — 
dP2i/dw in the upper panel and dP2i/dil\ in the lower in mK^. If changes in 
these parameters were fully degenerate the shapes of the associated changes in 
the power spectrum fields would be the same. In this case the dw plot is much 
more symmetric (the BAO ridges appear rectangular due to the log-log axes, see 
Figure 13) than the dO,\ plot where there are additional features in the line-of- 



sight direction. These plots show where an experimenter should concentrate their 
/c-space measurements to maximize the sensitivity to parameters of interest and 
minimize degeneracies. 
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Figure 12: This cartoon illustrates mode-mixing between a frequency dependent 
point spread function (PSF) and a mis-subtracted point source. At the origin 
of the horizontal axis a source has been mis-subtracted, and the residual flux 
ripples across the image due to the array's point spread function. Since the PSF 
scales with frequency (center frequency times fractional bandwidth, lowest fre- 
quency at bottom and highest at top) the positions of the radial intensity peaks 
change with frequency resulting in diagonal bands of spurious signal. At lines 
of sight away from the mis-subtracted source this leads to spectral fluctuations 
in the line-of-sight, as indicated by the contribution along the vertical dashed 
lines. The frequency dependent PSF is inherent in the measurement, and mixes 
a pure spatial component (mis-subtracted sources or mis-calibrated antenna re- 
sponse) into the frequency direction. In addition, as the frequency slope of the 
contamination depends on the distance to the residual sources, a wide range of 
line-of-sight /c-modes are contaminated. 
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Figure 13: Two-dimensional power spectrum of postsubtraction residuals for 



confusion level sources including chromatic PSF mode-mixing (from Bowman, 



Morales &i Hewitt 20091. For the left column of panels, the thermal noise has 
been artificially removed. It is evident that the foreground-only residuals follow- 
ing subtraction are much lower than the 21 cm signal in the range A;_l < 3 x 10^^, 
beyond which corresponds to when the uv coverage of the MWA becomes sparse. 
Panel (c) confirms that the 21 cm signal dominates the recovered power from the 
full-sky model since it appears nearly identical to panel (b). The white arcs in 
panel (f) illustrate the spherical shells of constant \k\ that are used to determine 
the spherically averaged one-dimensional power spectrum. Note it is only after 
averaging in A:-shells that the PS detection becomes possible. 



84 Cosmic 21 cm Fluctuations 

4 Conclusions 

The study of reionization and cosmology stands on the verge of a new era. In 
the next couple of years we will see the first sensitive attempts to measure 21 cm 
fluctuations from the EoR, and the start of construction of the first intensity 
mapping instruments. Progress towards understanding 21 cm fluctuations has 
been proceeding rapidly along parallel theoretical and observational paths. 

On the theoretical side, computer simulations of reionization have progressively 
achieved larger volumes and dynamic range, and have now reached a point of ma- 
turity where reliable (though not yet precise) predictions can be made. These 
simulations (an example of which is shown in Figure [2]) describe a hydrogen reion- 
ization that started with HII regions forming around the first galaxies. These HII 
regions grow to surround groups of galaxies, with reionization completing once 
the regions overlap and occupy most of the cosmological volume, leaving only 
dense islands of neutral gas. The dominant remaining numerical issues are con- 
centrated around the transport of radiation from the sources through the IGM. 
Numerical simulations use a range of approximations to address this problem, 
but have not yet converged to a common solution. Such a convergence is a criti- 
cal milestone on the path towards simulations that will be used to quantitatively 
interpret future observations of 21 cm fluctuations. With this goal in mind the 



Cosmological Radiative Transfer Codes Comparison Project (Iliev et al. 20061 
has compared a number of different radiative transfer codes on a set of standard- 
ised scenarios, including a cosmological density field. Although some differences 
remain that can be traced to technical details of the differing approximations and 
algorithms, the results are encouraging with general agreement on the properties 
of the resulting ionization field. These converged simulations will provide the 



Cosmic 21 cm Fluctuations 85 
benchmark for more efficient analytic and semi-numerical models whose flexibil- 
ity allows the exploration of more extremes of parameter space. As the numerical 
calculation of ionization structure becomes more accurate for a given model of 
high redshift galaxies, attention will need to focus on the properties of the sources 
driving reionization. In particular, self consistent account of all contributions to 
the development of ionized structure, in addition to stellar ionization from galax- 
ies, will need to be considered to draw robust conclusions from upcoming 21 cm 
observations. 

Observationally, there are four EoR machines planning to start sensitive 21 cm 
power spectrum observations in the next two years, and there are several intensity 
mapping instruments under active development. While the bright astrophysical 
foregrounds should not be a fundamental impediment to HI observations, they 
place very tight constraints on the observational precision needed to keep the 
foregrounds from mixing into the faint 21 cm fluctuations. The 21 cm fluc- 
tuation machines are pushing the state-of-the-art in low frequency calibration, 
digital data processing, high dynamic range imaging, and widefield observations. 
The impending EoR measurements will teach the observational community how 
to perform precision cosmological measurements at low radio frequencies. This 
experience will be invaluable for both subsequent EoR measurements and first 
generation intensity mapping machines. 

The past decade has seen a maturing of cosmology into a precision science 
via the development of massive galaxy redshift surveys and high resolution CMB 
maps. Observations of 21 cm fluctuations promise to revolutionize our under- 
standing of early galaxy formation and provide a powerful new tool for the next 
generation of precision cosmology measurements. 
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